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ABSTRACT 


Two stochastic models of population migration in the Peace 
District of Alberta, from 1921 to 1961, have been constructed. One 
model, utilizing the Monte Carlo method, is basically an experimental 
samplingdevice. The other is derived from the Markovian model of 
migration suggested by Rogers (1966a). The primary aim of the thesis 
is to examine the relative merits of both models in predicting population 


GUSsitriouiTonse 


The theoretical basis of the models assumes an inverse 
revaetonship between distance and intensityrof local migration. This 
relationship has been noted in many studies (Hagerstrand ios7 Marble 
and Nystuen 1963, Olsson 1965a). In the Monte Carlo model, the inverse- 
distance relationship is incorporated via Pareto expressions. The 
transition matrices of migration probabilities required by the Markovian 


model are constructed from gravity models. 


Detailed population characteristics are needed by both models. 
The necessary information was derived from the Albertan "Vital Statistics" 
reports, the "Census of Canada", and the "Census of Prairie Provinces". 
A sample questionnaire obtained the migration histories of over six 
hundred persons, and was used to calculate expected rates of external 


and internal migration from the twenty-six regions of the study area. 


Both models generate a theoretical or expected population dis- 
tribution for the study area in four different time periods. Observed 


and theoretical cumulative frequencies are compared by the one-sample 
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(ii) 
Kolmogornov-Smirnov test. The hypothesis, that any difference between 
observed and expected values is due purely to chance and is not signifi- 
cant, is either accepted or rejected depending on the maximum difference 


between the cumulative frequencies and the confidence level chosen. 


The Kolmogorov-Smirnov tests indicate that both Monte Carlo 
and Markovian models reproduce the features of the real world, within 
reasonable bounds of confidence. The Markovian model is considered more 
effective than the Monte Carlo method, since natural increase, external 
and internal migration can be incorporated in a more realistic manner. 
The calculations required by the Markovian technigue are much less 


laborious than those needed by the Monte Carlo method. 


The author adds the qualification that the inability of the 
Markovian model to incorporate additional factors, other than the 
see ene distance variable, must be overcome. In contrast, the 
Monte Carlo model modifies the inverse-distance relationship of migration 
by barriers to movement. However, the performance of the Markovian 
model, indicates that it could be applied to other forms of spatial 
interaction, which traditionally have been treated by the Monte Carlo 


method. 
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CHAPTER ONE 


THE PROBLEM 


Aim 


The object of this study is to investigate the relative merits 
of two stochastic models in generating a migration process. The Monte 
Carlo method is well established as a technique dealing with the 
diffusion of innovations and the movement of people. Recently, 
proposals pe made to apply Markov chain models to some of the 
processes traditionally dealt with by the Monte Carlo method (Olsson 
and Gale 1968). It is hoped in this study that the relative efficiency 
of the two models can be gauged by their application to migration in the 


PCE IEVEie DivsneTaLGlie Giz Millcyaeime) aeiceyil IES AIL wee) ISeil, 


Each run of the Monte Carlo model produces a simulated 
population distribution. Since the results.of the runs can vary, 
Pattse (1963) has ereeesred that the results of a series of simulation 
runs be averaged. A secondary aim of this study is to evaluate the 
significance of an averaged simulation solution. There is no similar 
problem with the analytic Markovian model, which predicts only one 
population distribution. Brown (1968b) has attempted to synthesize the 
development of models which treat spatial patterns resulting from 
spatial behaviour. These applications include models of diffusion of 
innovations (Hagerstrand 1967a), migration (Morrill 1965b), urban growth 
(Garrison 1962), settlement (Bylund 1960) and economic Deo anent 
(Curry 1967). The potential of such models WEE: Gacemonee been stressed 


but the number of studies utilising the Monte Carlo and Markov chain 


techniques is still relatively small. The performance of the Markov 


chain model in the limited field of migration has important consequences. 


The general nature of these models is emphasized. Each model 
contains facets which are worthy of further study. The correct use of 
distance-decay functions in Monte Carlo methods has aroused considerable 
controversy (Morrill 1963a). Testing of simulated results against real 
world observations raises the question of which non-parametric tests to 
use (Pitts 1963). While these problems have been encountered and dealt 
with, the accent in this thesis is on the achievement of a general 
working model rather than on suggestions of detailed modifications to the 
operation of the models. The validity of this approach is considered 
reasonable, since the application of such models has lagged behind 


suggestions about their utility. 


The process of migration in this study has been considered in 
specific terms. Explanation of population distribution and redistribution 
should take cognizance of a variety of physical, economic, social and 
political factors. Only migration—the way in which redistribution of 
population occurs—has been treated here. Conditions of natural increase 
among the settled population and external movements are constraints given 
tothe models from cutsiae (Morrill 19655, p. 101). «Local migration is 
treated internally and permitted to vary according to the principles of the 
models. The intensity of local migration is related to information, the 
supply of witch falls off with distance (Morrill and Pitts 1967, Marble 
and Mystuen 1963). This is low-level explanation. If detailed 
explanations of spatial patterns are sought, then space preferences have 


to be examined. Spatial patterns are the result of individual or collective 


comparison and evaluation of alternative spatial opportunities (Rushton 
1969). There is no reason to suppose that higher-level explanation, 


involving human perception, can not be built into the basic framework 


examined here. 


Theoretical Background 


1. Models 

The author is concerned with processes which create spatial 
Patterns. The place of the model in the Scientific Method is proven, 
though the term model has many connotations. Chorley and Haggett (1967) 
have set out in great detail the characteristics and functions of 
various types of models. The model is commonly regarded as a simplified 
Sees of reality in which certain relationships are generalized. In 
this sense,.the model is an approximation, and its value must always be 
set in regard to its degree of abstraction. Such models are educational, 
because phenomena can be visualized and comprehended in a fashion not 
otherwise possible. Insight into the process of migration is gained 
explicitly in this study through the operation of the models, and 
implicitly through the labour of fashioning them. Models are also a 
means of organizing data. They provide frameworks into which information 
can be fitted and ordered. Models act logically and account for the 
behaviour of phenomena, since explanation involves the collapsing of 
complex systems into simpler ones where interdependencies and relationships 


can be observed. 


To many geographers, the model is a means to link theory and 


observation. Models are abstractions, and the value of the model is 


4 
conditioned by the degree to which it approaches and corresponds to the 
real world. Depending on the researcher's aim, models can be used to 
make predictions about the real world that they model. In this sense, 
the model tests a hypothesis or expectation based on theoretical 
postulates. If the model and the real world agree within the bounds of 
accepted limits, then the hypothesis or the postulates can be accepted 
as valid or at least not rejected. Even if there is considerable 
difference, extensions and modifications to the model should suggest 


themselves. 


The models in this study are based on the assumption that there 
is a relationship between migration intensity and distance. If the 
generated results compare "closely" with the real world then this 
inverse-distance relationship can be considered "true". But how should 

[ 
the modelled results be tested against the real world? When does any 
difference become significant? Such questions are considered more fully 
in a later chapter. - Many studies involve reasoning about the theoretical 


postulates, while neglecting the important aspect of testing these 


assume ctones (pa tbe wlI6$ 7 pps) ll 6—-119) 


the sample inverse-distance postulate has been modified by 
space preferences, such as the operation of physical or functional 
barriers to movement. The models in this study are probabilistic rather 
efor deterministic, because they relate to situations where not all of 
the variables or relationships are known. The models treated here are 
historical-predictive, since they deal with a process operating through 


time and attempt explanation of that process. 


2. A Stochastic Viewpoint 

Until recently most work in Theoretical Geography has been 
OCrerminisciceasmeanresulteorethesimportance in®geegraphy ofthe 
optimising notions of Economics (Olsson and Gale 1968, p. 229). 
Typical of such concepts was the hypothetical "Economic Man", who had 
perfect knowledge and the capacity to make completely rational decisions. 
Curry (1966 p. 42) described the deterministic approach as the 
"specification of variables and causes of certain intensity and inter- 
action which will lead to a result differing from reality by an error 
term". If the problem of predicting a pebble's movement downstream were 
considered, or the difficulty of specifying a migrant's distination, then 
the immensity of this task become evident. All variables and causes can 
not be specified. |A deterministic model of pebble or migrant behaviour 
could be constructed but, given man's level of ignorance and the multiplic- 
ity of factors involved, such models would lack reliable predictive 
power. | ft a2 deterministic approach were adhered to, then the problems that 
geographers can deal with would be effectively limited. Attempts to 
improve the predictive power of their models have led geographers to 
conceive behaviour as probabilistic or random and to construct! 


probabilistic models. 


Examples of the probabilistic approach can be found in the 
Physical sciences (Colé and King 1968, pp. 98-99). The deterministic 
relationship of cause and effect has been rejected in Quantum Theory. 


Elemental particles are considered to obey laws of chance, and their 


lrandom behaviour has the implication of complete haphazard- 
ness. maerrovabidasticdortstochastic are preferredaterms. 


behaviour can be expressed only in terms of statistical probability. 
On the macro-scale of the universe, the Theory of Relativity also 
discredits determinism. Since position and time are not constant, 
effect possibly could precede cause. At earth scale, where the law of 
gravity holds, the precedent of Quantum Mechanics and Relativity 
becomes less evident. Yet, by following probabilistic reasoning, 


more is achieved by accepting less. 


For both pragmatic and philosophical reasons, geographers 
have been encouraged to attempt probabilistic explanations. A process 
is regarded as stochastic if it "does not proceed according to any 
immutable law but is at least partly dependent on random and chance 
PACrOrESs se barclece 1962, Da 49). BAL ents stage, £6r the purpose of 
clarity, a distinction should be made between "grand probability"— 
the consideration of degrees of uncertainty in treating behaviour— 
and Probability Theory—the essential mathematical basis of statistical 
theory”. Stochastic models are developed by reference to Probability 


Theory. 


There are many fields in Geography where the researcher can 
appreciate a viewpoint in which decisions and actions are set within a 
probability framework. Some studies have treated individual behaviour 
in probabilistic terms (Wolpert 1965), and others have dealt with group 
responses (Golledge and Brown 1967). Attempts have been made to phrase 
traditional concepts in new terms, such as Curry (1967) who suggested a 


new theory of central place systems in terms of queuing theory. 


2Some people regard the Theory of Probability as a branch of 
pure mathematics, and Statistical Theory as the epplicavion of tiis 
mathematical theory to statistical phenomena. (Bartlett 1962, p. 15). 


Probability models have been developed on m aggregative level for intra- 
urban (Malm, Olsson, and Warneryd 1966) and inter-urban systems 


(Morrill 1963b). 


Whether the problem is physical or human, spatial or temporal, 
individual or collective, retrospective or futuristic, the probability 


approach has validity (Curry 1966) . 


3. Simulation Versus Analytic Approaches 

A probabilistic approach can solve some but not all problems 
in geographic research. Among successful applications of Probability 
Theory in Geography is Dacey's work on the analysis of point patterns 
(1966). Yet, such efforts are merit aK descriptive rather than 
explanatory. The imprecise nature and great number of variables in 
geographic problems has frequently resulted in models that are over- 
simplified. Analysis through the use of means and dispersions, 
regression, and correlation approaches have been employed in many 
studies. Unfortunately, standard statistical techniques need not provide 
required answers. When over-simplified models are used, little more than 


a crude comparison with actual data may be justified. 


The difficulty of deriving adequate analytic solutions has 
encouraged many researchers to accept methods of simulation in place of 
more formal mathematical theory. Of the former, the Monte Carlo method 
is a widely used technique. Its characteristics will be examined in 
Chapter Two. Simulation approaches, providing methods of analysis where 
no others are possible, are not always satisfactory because of the 


difficulty of evaluating the results. In contrast, the Markov chain 


model presented in Chapter Three is an analytic solution. 


Method 


To consider temporal as well as spatial characteristics, the 
models were applied to four ten-year periods from 1921 to 
1961. The procedure consisted of: 
1. collection of appropriate information and its transformation; 
2. creation of Monte eae and Markov chain models; 
35 operation of the models; 


4. analysis and evaluation of the results. 


Leathe. Derivation) of Information 

The Peace River study area was divided into twenty-six regions, 
each of which was an amalgam of census sub-divisions. For each region, 
in each time period, information was collected on population numbers 
and density, natural increase, number of people available for internal 
migration, migration characteristics, and distances between all regions. 


In addition, certain barriers to movement between regions were determined. 


Zee creation of the Models 

The derivation of the models is treated in detail in Chapters 
fwo. and Three, ne this point, it is worthwhile to consider some 
peculiar difficulties. The redistribution of population is only one 
aspect of a general stochastic model of a migration process. Assume a 
certain population in a region at a time t . In the following period, 
whatever its duration, some people will leave, some will die, others 


Wiwldvarrive or be born. The composition and size of the region's 
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population will change. The elements of the change, births, deaths, 
and migration, occur together continuously. Any operational procedure 
must first segregate change into discrete time periods. Secondly, the 
elements of change must be treated separately. Conceivably, a model 
could be designed to avoid these simplications, but its complexity and 
the amount of data required would be prohibitive. The only feasible 
alternative is to design operational modes which are nearer to, than 
further from, the truth. Consequently, the Monte Carlo model segregates 
natural increase, external and internal movements. In this respect, 
the Markov chain model is more flexible, although it too requires the 
different components to be introduced separately. This point will be 


taken up again during consideration of the models' design. 


3. Operation of the Models 

The objective of both models was to create a new population 
distribution at the end of a time period, given the distribution at the 
beginning of the period and certain operating procedures. Natural 
increase and external movements are taken as constants, but local 
Migration varies according to the principle of the model. The latter 
depends upon the probability of interaction between regions, which is 
derived theoretically. Since the number of people moving in any one 
time period was great, and the probability of interaction extremely 


complex, the models are designed to be run by computer®. 


4. Analysis and Evaluation of Results 


The testing of generated results against actual observations is 


3There are a number of programmes relevant to these studies 
including Marble (1964) and Pitts (1965 and 1967). The programmes 
necessary for this study were prepared by Dianne Rose. 
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a crucial aspect of all models. Unless this procedure is adequate, the 
model will lose any claim that it has to objectivity. The assumption is 
made that the real world has only a probability existence. It is only 

one of a possible series of results. Consequently, there is no 
information available on the nature of the underlying distribution from 
which it comes. Since no assumptions, such as normality, can be made 
about this underlying distribution, non-parametric tests of correspondence 


have to be used. 


The results of the models have been tested, for each time 


period, against each other, and against the real-world situation. 


The Study Area 


Theesikwdy area is the core of the Peace River Dustrict of 
Alberta (Figure 1)-— Settlement in°its northern and southern districts 
have been excluded because of the small size of their population. The 
operation of the models was observed through time. The Peace is a 
laboratory in which the process of frontier settlement and population 
redistribution can be examined almost from the beginning of settlement. 
The area may be compared to a newly emerged volcanic island, which goes 
through the complete stages of growth from pioneers to associations. The 


study area is described in greater’ detail*ain’Chapter Four. 


Data Base 


Data sources do not compare well with Sweden where many 


Monte Carlo models have been applied (Hagerstrand 1957, Morrill 1965b). 
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Complete disaggregated data on population and other characteristics 
are absent. Better data would result if a shorter, more recent time 
period was taken. Yet, investigation of the performance of the models 


through time is considered important. 


Tae Wsttten Sources 


The written sources of information are the Dominion Bureau of 
Statistics censuses for Canada and the Prairie Provinces", and the 
Albertan "Vital Statistics" Reports. The study area is composed of 
census divisions fifteen and sixteen°. Births and deaths are recorded 
in the census for divisions. Consequently, the net migration gain or 


loss for the entire study area is obtained by the residual method (Isard 


1960, DO. 54-55); 
-P ) -N Cpa) 


where My is the net migration over the time period, PL the 


population at the beginning of the time period, El the population 
at the end of the time period, and Ng the natural increase over the 


time period. 


The same procedure was used for the regions over the period 
1932 £0 .1944, since births and deaths are given at the census sub- 
division level in the Albertan "Vital Statistics" for these years (but 


not before or after). For the rest of the time, formula (1.2), was used, 


The Separate Prairie Provinces Census was discontinued after 
1946. 


| 


Safter 1951 the two divisions were combined. 
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Ma = (2 


9 re - (OP ,+6P 4) (12) 


CEO 


where My is the net migration over the period, Ee is the population 


at the beginning of the period, is the population at the end of 


P 

t+9 
the period, a is the birth rate over the period, 8 is the death rate 
over the period, and Py is the population of the region to which birth 
and death rates are applied. Crude birth and death rates were used 


Since no breakdown by age and sex is given for census sub-division. 


Resultant figures for regions are reasonable approximations 
of the true situation. Despite their limitations, the data are 


adequate for the general models in this study. 


2. The Sample Questionnaire 

The number of potential migrants in each region was derived 
from information gained from sampling with a questionnaire. Many of 
the original settlers or their first generation descendants still 
survive. This was an important consideration in deciding to sample 


the population. 


The sample population was based on the Peace River Telephone 
Directory which listed 12,235 private households. One hundred and 
fifty were drawn completely at random, with a supplementary Lust sor 
fifty. A sample size of one hundred and fifty was chosen over the 
conventional figure of one hundred, since, at one stage the transition 
matrices of migration for the twenty-six regions were to be eee ees 


from the sample. To ensure complete coverage for all the six hundred 


and seventy-six cells, the larger sample size was taken. In practice, 


14 
the transition matrices were estimated by a gravity model, not 


empirically from sample information. 


Interviews on the migration history of individuals in the 
households were conducted by telephone and mail. In this way a 983% 
response was achieved. Each pemeeneila was sent a copy of the questions 
to be asked (see Appendix D), and informed that a telephone call would 
nO LOW ome questionnaires were completed and returned by mail, but 
the bulk of the interviews were carried out over the phone. The high 
degree of success was obtained by use of the supplementary list when 
individuals had moved, were on vacation, or no response was achieved 
after several telephone calls. Since migration was being considered back 
to 1921, individuals were asked to provide information, not only about 
themselves and their spouses, but also their parents and in-laws. A 
complete spectrum of knowledge of the movements of six hundred and 


fourteen persons over the period 192igeco, 1961 was achieved: 


The sample did not include interviews with people who lived 
in the Peace prior to the date of interview and who left for other 
regions of the world. Similarly, people who may have lived in the Peace 
for several decades, for example, and then died without leaving offspring 
in the Peace River, could not be included. Furthermore, some bias may 
have been introduced into the sample because interviews were arranged 
only with those families which were listed in the telephone directory. 
The sampling procedure might introduce a bias, if people with telephones 
were more mobile than those without. On the other hand, there is a 


possibility that people with telephones might be better adjusted to the 
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local area, and less likely to migrate. These forms of bias could not 


be more adequately determined in this study. 


Finally, the size of the Peace Region might vary from that of 
areas used by other researchers. It is important that the results of 
this study be interpreted within the general size and situation of the 
Peace Region. The size of the sub-regions was determined by the desire 
to reduce the total number of census subdivisions to a manageable 
number. The aggregation of sub-units into twenty-six larger districts 


was determined on the basis of ethnic, topographic, and settlement 


characteristics of the census sub-divisions of the Peace District. 


In conclusion, the primary aim of the study is to investigate 
the relative advantages of the Monte Carlo method and the Markov chain 
model. The results achieved by the models for each of the four decades 
over the period 1921 to 1961 are tested against the actual population 


distribution in the Peace River study area. 


The history, nature, and detailed construction of the models 
are given in Chapters Two and Three. Chapter Four deals with the 
operation of the models, while predicted values and observations are 
tested against each other in Chapter Five. The ress of the tests 


are analysed and evaluated in the final chapter. 


CHAPTER TWO 


THE MONTE CARLO METHOD 


This section describes the history, potential, and construction 
of simulation models. The terms, simulation and model, could be 
regarded as synonymous, though in practice, the former refers to a 
specific group of models, of which Monte Carlo is the best known. 
Monte Carlo simulation treats situations where not all the factors and 
relationships are known. Hence, a probability concept is introduced. 
Basically, Monte Carlo methods are concerned with sampling experiments 


on random numbers! , 


History of Monte Carlo Methods , 


The term Monte Carlo, which formally emerged in 1944, has 
various aspects., Its significance to a mathematician is different to 
EnAvsOLsasDaysicals Seientist OY a geographer. Partly, this asvdue to 


Enesmeciod vss tirectold origin. (Tocher 1963, p. 2). 


The oldest application of Monte Carlo methods lies in the 
sampling experiments of statisticians. While there were numerous 


sampling experiments in the second half of the 19th century, modern 


lconsider, for example, the problem of estimating the area of 
an irregular region. One method would employ a sampling experiment. 
The irregular region is surrounded by a region of known area. One 
hundred points are located by a random procedure over the known region. 
The proportion of points which fall in the irregular region provide an 
estimate of its size as a proportion of the known region's area. 


2There were numerous experiments of throwing a needle ina 
haphazard fashion on to a board ruled with parallel straight lines. The 
value of tm was infered from the number of intersections between needle 
and lines. 


NEY 
sampling experiments began with the work of "Student" (W.E. Gossett). 
In 1908 he used sampling to bolster his faith in the "t distribution" 
which he had derived by theoretical analysis. ‘This work was taken up 


by Pearson, Galton, and others (Tocher 1963, pp. 43-44). 


The object of sampling experiments was to determine the amount 
and form of variability that some estimate of a parameter would have in 
repeated sampling. Although, mathematical techniques have developed to 
great levels of sophistication, the value of sampling experiments in 
mathematical statistics still remains, especially with the availability 


of high speed computers to perform laborious calculations. 


The second origin of the method was created by the demands of 
applied mathematicians and physicists. In connection with work on the 
atomic bomb during World War II, von Neumann and Ulam utilized the 
technique to simulate the probabilistic nature of neutron diffusion in 
fissionable material. Since then, the technique has been employed to 
deal with many physical systems where particles behave in a partly 
regular, partly irregular fashion (Meyer 1956). The Monte Carlo method 
permits a participant to play a game of chance, the rules of the game 
allowing deterministic and stochastic features of the process to be 
reproduced. By considering the results of a large number Re erieiee 
the distribution of particles or the number of particles which escaped 
through a medium at the end of a time period can be estimated. Similar 
techniques have been used to determine when particles are going to 
Collide, the number Or resultant particles and Subsequent collisions. 
Monte Carlo, in this sense, is a "device for studying an artificial 


stochastic model of a physical or mathematical process", and involves the 
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use of sampling within a probabilistic framework (Meyer 1956, p. 5). 


Physicists have extended the technique to deal with 
deterministic problems (Hammersley and Handscomb 19%4, p. 7). Partial 
differential equations arising in the abstraction of physical processes 
can not always be solved by conventional analytic methods. It is 
possible to find a probability analogue to the deterministic equation. 
The Monte Carlo method enables the statistics of the analogue to be 
computed by a sampling experiment. An answer is thus provided for the 


deterministic equation. 


The development of Monte Carlo techniques in Operations 
Research, the third origin, is most relevant to Geography. Operations 
Research seeks to apply the Scientific Method to military, managerial, 
and industrial problems. In keeping with many sciences, it sets up 
models to ete insight into processes considered. Unlike Physical 
Science, but similar to Geography, Operations Research studies a great 
variety of phenomena. Situations arise which, when incorporated into 
models, can not be solved adequately by normal mathematical methods. 
Many researchers, like applied mathematicians, have turned to sampling 
experiments to achieve results where no other methods are possible 


(Harling 1953). 


Simulation techniques have passed via Operations Research into 
the Social and Behavioural Sciences, including Geography. There, too, 


they offer an approach where no others seem likely. 


Owing to the multiple origins and use of Monte Carlo, the 
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19 
technique can be classified in different ways. The simplest system is 
a division into "probabilistic" and "deterministic" Monte Carlo 
(Hammersley and Handscomb 1964, pp. 1-12). The former serves as a 
Gevice sto mscugy an artifievalsstocnastic model of a physical or 
mathematical process. Random numbers are chosen in such a way that 
they directly simulate the random events in a stochastic process. 

The desired solution is inferred from the behaviour of the random 
numbers. In the simplest and crudest "direct" simulations, no more is 
involved than transference of the stochastic features of the real 

world into the model. In more sophisticated efforts only those features 
of the real system are incorporated which affect the solution of the 
model. In other words, care is taken to ensure that the model neither 
oversimplifies the system, nor incorporates so many elements that it 


becomes unwieldy (Harling 1958, p. 307). 


Deterministic Monte Carlo involves the creation of probability 
analogues to solve deterministic problems. In this context, Monte Carlo 
is used to fill gaps in theoretical mathematics or deterministic 


reasoning. 


Since most "geographic" processes are regarded as stochastic, 
Monte Carlo techniques in Geography typically are of the probabilistic 
type. Bunge (1969) has suggested that simulation and conventional 
analytic techniques could be combined. A form of "deterministic" Monte 
Carlo could be employed in which, for example, parameters of probability 
functions could be estimated. As yet, this is an unexplored avenue of 


research. 
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The Monte Carlo Method in Geography 


Much of the credit for introducing the method into Geography 
must be given to Hagerstrand. The impact of his book, published in 
1953, has been so considerable, despite a time lag caused by lack of 
translation into English (Hagerstrand 1967a), that few fields 
relevant to Geography are now untouched. Monte Carlo methods have been 
applied to the diffusion of innovations (Hagerstrand 1965), migration 
processes (Morrill 1965b), growth of urban settlement (Morrill 1960), 
and intra-urban growth and differentiation (Malm, Olsson, and Warneryd 
1966). The popularity of the technique has led to efforts to develop a 
formal basis (Brown 1968a). As Pred suggested (1967), applications 
range from analysis of past distributions, future projections, and 


treatment of static distributions. 


Essentially, these methods have been employed to deal with 
problems where no satisfactory mathematical solutions are feasible. 
This is the case>when the prediction of a migrant's move is considered. 
A deterministic approach must be replaced by ’a probabilistic one, if a 
greater degree of precision is to be achieved. Admittedly, the 
transformations of space suggested by Tobler and Bunge (Bunge 1969) 
may reduce the necessity of so many random variables in model development, 
though it is unlikely that such transformations could make a random 
Ane completely feasible. Equally, with many problems in Geography, 
conventional Probability Theory can be applied only to crude and over- 


Simplified models. 


The last comment can be appreciated by considering a simple 


analogy. Suppose that it is desired to know the chances that the 
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24 
number of hits in ten throws at a coconut shy is even. This could be 
achieved by taking a series of ten thousand throws at the shy. On the 
other hand, conventional probability theory can provide an answer 
without the necessity of experimenting (Schreider 1966, p. 3). However, 
if it were desired to estimate that the chances of a given game of 
partience coming out were greater than even, then there is no way that 
Probability Theory can provide a reliable answer. The number of 
possibilities for any one trial is too great. The only way to achieve 
an estimate is to play-the game again and again (King 1951, p. 13). This 
applies to all processes, including migration, where the phenomena 


Cae varyecreatly., 


A crude deterministic model of migration, such as the gravity 
model, could be derived, or probability functions could be developed. 
Thetlevel “of precision is low because of oversimplification. It is here 
that the value of the experimental Monte Carlo method can be appreciated. 
Underlying theoretical considerations, such as the inverse-distance 
rélationship, can be incorporated. Yet, the allocation of random numbers 
to random events permits the element of chance to be introduced as well, 
so that the researcher's level of ignorance and internal irrational 
behaviour of the system can be appreciated. A simple example will clarify 


these points. 


Lf the intensity of flows or interactions between points is 
assumed to fall off with increasing distance, then this relationship may 


be expressed in a simple statement, such as the gravity model. 
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ie, =19 iS CA) 


where ve is the interaction between place i and place j, Ps 
and oe are the populations of i and j respectively, me is the 


distance between i and j, and g and b are empirically derived 


constants. 


The gravity model is a deterministic statement. It can be 


converted to a probabilistic expression, such as the Pareto function, 


Cc 
p(d) = a5 2a 74) 


where p(d) is the probability of interaction between two points, D 
is the distance between, and c and b are empirically derived 
constants. Of course, several such expressions could be developed and 


their relative merits argued (Morrill 1963a). 


This is the starting point of the simulation. In some cases, 
equation (2.2) is fed directly into the model (Pitts 1967), but more often 
a special probability matrix is employed (Figure 2a). Conventionally, 
this is a five by five grid, where each cell is 25 square kilometres 
(Morrill and Pitts 1967). The operational procedure superimposes the 
grid over each potential migrant in turn, so that the migrant's location 
becomes cell 13 (Figure 2a), the centre of the grid. The probability 
of movement to any other cell is given by the figures in each cell 


(Higures2>) ) whiehitsum Comunity. )elheserprobabality figures are derived 
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Figure 2a 
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Random Number Targets 
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Figure 2b 


Probability of Interaction 


Figure 2d 


Destinations of Migrants 


MONTE CARLO PROBABILITY GRID 


Malm, Olsson, and Warneryd (1966) 
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24 
from expression (2.2). Since most probability functions tend to give 
inaccurate estimates for close-in distances (Morrill and Pitts 1967, 
p. 415), the value of cell 13 is customarily derived from empirical 
observations. The other probabilities are then adjusted accordingly. 
It should be apparent that the cell values run symmetrically about the 
eps che coat the voluccmrOmicells 918). 147 8, and. 12) are similar, 


and ssopon, (Fagure 2b) . 


16 Pareto function, or some Similar expression, illustrate no 
more than a crude relationship. Each migration can be regarded as a 
random event and can be allocated a random number. If the probabilities 
in the matrix are converted to a tally of discrete numbers (Figure 2c), 
a range of digits is derived for each cell. Since there are more numbers 
ioece) we lLoethaneineanyeotner) cell, the possibility of selecting a 
random number in that range is greatest. Yet, because numbers are drawn 
completely at random, there is always the possibility of a less likely 
destination being chosen. Assume that five migrants are to be distributed 
by the illustrated system (Figure 2c), that the five draw numbers 200, 
S01 we5 5) 900, and.) .ooG, sand hence move to locations 7,920, 25, and 


21 respectively (Figure 2d). 


By using similar techniques physicists have estimated the 
distribution of elemental particles. Similarly, in the migration process, 
the method permits the distribution of migrants to be estimated at the 
end of a time period. Each migrant is assumed to behave in a partly 
eveguilar, partly irregular fashion, the former through the distance-decay 
function or some similar theoretical postulate, and the latter by the 


allocation of random numbers. 
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This is an extremely simple illustration. Other considerations 
need vo serreised, and will be discussed in the next section. In summary, 


simulation provides three basic advantages to geographic research. 


1. Insight into the Process Concerned 

The experimental running of a process can often be heuristic. 
Simulations of innovation diffusion have suggested refinements to the 
idea of a diffusion wave, such as the filtering down through the urban 
hierarchy (Hudson 1969b). Such simulations, operating like time-lapse 
photography, have cen the continuing importance of the core area 
(Hagerettands L965 p S56) eine ability of the technique to "act as a 
framework for the organization of data suggests that this should be its 


main function. 


2. Pragmatism 

A number of studies in Geography and Urban and Regional 
Planning emphasize the value of the technique as a tool to envisage the 
impact of change upon a situation such as urban growth. (Goldberg 1968; 
Recht 1968; Walter 1968). Morrill (1965c) has used the method to 
suggest the expansion of the Negro Ghetto. Elsewhere, the technique 
has been used to illustrate the effect of changing urban renewal inputs 


upon urban growth®, 


34. Testing of Hypotheses 
If probabilities are derived on the basis of certain postulates, 


then the simulated distribution should reflect the theoretical postulates. 


3For example, note "Community Renewal Programming: A San 
Francisco Case Study", a report prepared by Arthur Little Inc. in 1966. 
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If the results are tested against the real world, then the degree of 
correspondence should indicate how accurate the postulates are. In 
practice, serious difficulties arise with regard to the testing of 
simulated patterns, so that the heuristic and pragmatic qualities are 
often claimed as the technique's chief attributes. Chapter Five treats 


the difficulty of deriving adequate tests in detail. 


Some Relevant Considerations in the Construction of a Monte Carlo Model 


The chief virtue of the Monte Carlo method is that it permits 
ignorance to become a strength. While less vigorous than mathematical 
approaches, it offers a solution where no others are, as yet, 


conceivable. There are, however, several operational pitfalls. 


1. Number of Observations 

Since the method is based on the selection of random events, 
one simulation can vary greatly from the next. The implications of 
this for evaluation will be discussed below, but this characteristic 
also implies that a large number of independent events must be 
necessary. It seems appropriate only to apply the technique to problems 
where there are a large number of events. A large number of people 


move at any one time during migration in the Peace. 


2. Selection of pe meee ari Sachi oe 

Garrison (1962) suggested that the application of the simulation 
technique should be carefully chosen, lest the researcher's original 
ignorance be reproduced. If the unlikely task of simulating all possible 
Pera cioneni ps in the real world succeeded, then all that would have been 


produced is a "black-box" model. This would provide no insight into how 
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27 
the processes function. Instead, it is necessary to concentrate on 
certain relationships within a simplified framework, and thereby, through 
continual modification and verification, to build up knowledge. Thus, 

eimutations.. «Coes Ot ask £or exact replication, tor that 
would destroy or obscure the generality of the model and 
the theory on which it is based. Of course simulation will 
Mer lect tte rules or the model itself, but owing to randomness, 
Oey -reccom Of ~Ccnolce, adetange Of Variadi li tyemay, Descxpected. 
In general, we wish a simulation to be enough like the real 
world that both could have been produced from the same process 
(Mower 965o55 0 OW). 
3. Theoretical Significance of Probabilities 
Closely associated with the last point is the derivation of 
probabilities of random events. The real world could conceivably be 
reproduced exactly in a simulation by carefully modifying the probabilities 
through a feedback process. Equally, some set of probabilities could 
be derived by nothing more than a casual examination of the process. 
Unless theoretical meaning can be given to the probabilities, Monte 
Carlo methods become crude direct simulations of the process concerned. 
Little or no insight into the process is gained. Hagerstrand (1965) 
Was able to provide the probability of innovation diffusion or migration 
movement with theoretical significance, by relating the interaction to a 


distance-decay function expressed in the Mean Information Field concept 


(Marble and Nystuen 1963). 


4. Dynamic Rather Than Static Situations 

These techniques are frequently employed to treat dynamic or 
Changing situations. “Such is the case with historical-predictive models. 
Consequently, probabilities should not be expected to remain constant 


over time, especially when human rather than physical processes are 
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28 
treated. Changing inputs or feedback will result in changes in the system. 
Conventional approaches derive different sets of probabilities for 
different time periods, though of course, time changes are continuous 


euavel inene Glilfsreraeiee)q 


There is also the problem of equating a simulation run, 
or a generation in a simulation, with real time. In diffusion studies 
the usual method relates simulation to time by means of a linear- 
logistic curve (Dodd 1955), since the number of adopters in a population 
follows a double asymptotic or S-shaped curve through timet. 
Intuitively, it seems more difficult to apply this concept to the 


migration process. 


5. he Significance of a Simulation Run 

The problem of gauging how well the theoretically simulated 
pattern corresponds-with the real world is closely related to that of 
judging how well one simulation fits the theoretical distribution. 
Does one simulation provide a good indication of the distribution 
based on theoretical postulates, if it is known that simulated results 
vary widely? If not, then how many simulations are necessary? If an 


"averaged" solution were derived, would it be valid? 


The simulation distribution, a statistic which estimates a 
population parameter, represents one pass through an unknown, underlying 


population. As with all sample statistics, the question is asked, 


‘The number of adopters is at first small, then increases 
rapidly, and finally falls off as saturation of the population is 
reached. 
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"how representative is the statistic of all observations that might be 
made?" Good sampling practice involves the use of variance-reducing 
techniques, the specification of an adequate size of sample, and an 
estimate which indicates how likely the sample is representative 
(Hammersley and Handscomb 1964, p. 5). Unfortunately, the nature of 
the irregular, simulation statistic and lack of knowledge of the 


underlying population do not permit the use of conventional sampling 


practice. 


Error terms can not be derived. In some problems in Operations 
Research, variance-reducing techniques are employed, whereby certain 
transformations and alterations to the original data are carried out to 
emereasewtile, EEULN Or the result (Tocher 196370. Ll6) Application 
of such techniques has not been possible in the cases where the method 
has been employed in geographic eEA Although alteration of 
probabilities by feedback could be envisaged as a form of variance- 
reduction, the theoretical and empirical significance of the data might 


be ruined. 


In the absence of variance-reducing techniques, a good 
approximation of the theoretical population is achieved by increasing 
the size of the sample (Tocher 1963, pp. 2-3). Since the accuracy of 
estimates for the parameters of the unknown distribution is a function 
of the number of observations, computer programmes have been written 
to “average-out" a large number of simulation results (Marble and 
Bowlby 1968). The suggestion is that the "averaged" solution more 
closely approximates the ultimate or "omega" distribution than one 


simulation (Pitts 1963). Yet, how many simulations must be averaged 
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before a good approximation is derived? Maass eteal. (1962 en 
considering the design of a water resource system, have suggested a 
close fit between the ultimate distribution and a simulated solution 


after two hundred runs out of a possible Tore simulations? 


This'is by no means a Satisfactory solution, nor is it 
feasible in all cases. Certainly, if simulation runs are regarded through 
time, an "averaged" solution may approximate the omega distribution. 
Much depends on the point of view of the researcher, and the use to which 
a puts his work. Tf history is seen in terms of degrees of events 
rather than particular events (Curry 1966, p. 46), then this solution 


appears adequate®. 


On the other hand, if the results of a simulation are viewed 
spatially, then the significance of an "averaged" solution is difficult 
to envisage. Morrill (1965b, p. 172) suggested that "we may not average 
unigue patterns". A pattern could be simulated over and over again, but 
if the results are averaged, all that is achieved is a smooth spatial 
surface. For example, if two points on a transportation system are 
taken, then the number of simulated links between them can not be 
averaged out. The Monte Carlo simulation model employed in this study 
produces a new population distribution at the end of each time period. 
THis Cistrivution, Like that ot the real world, can be regarded as one 
sample from many possible solutions. Since discrete population units 
are treated oatratner than network links) an "averaged" solutions seems 


“tn this case, the number of variables is very much smaller 
(fawn) tie case Cf migratzon i the Peace District. 


Ore degrees of events are accepted, then real history becomes 
only one outcome in a whole series of possible outcomes. 
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6.) Jnverse-Distance Relationship 

The distance-decay relationship has been summed up by Zipf's 
"Principle of Least Effort" (1949) and Bunge's (1969) contention that 
things locate as close to each other as possible. An early deterministic 
summary of the relationship was the gravity model. More recently the 
concept has been embodied by distance-decay functions, such as the 


exponential, Pareto, er log-normal. 


The inverse-distance relationship provides a basis for many 
studies including diffusion (Hagerstrand 1967a), migration (Morrill 1965b), 
urban growth (Malm, Olsson, and Warneryd 1966), and other forms of 
Piberectton, suchmaseconcumer Dehaviour (Really 1031) — For example, in 
the absence of adequate data on social contacts, Hagerstrand (1957) 
fell back on surrogates such as local migration and telephone calls. 
Basically, Hagerstrand assumed that the spread of an idea occurs 
largely as a result of personal contacts. If the spread is governed 
by information, then there is clearly a relationship between information 
and distance. Time, cost, and effort of interaction become greater in 


absolute terms as distance increases. 


Migration falls off with distance (Hagerstrand 1957). Further- 
more, Migration 1S a function of information, particularly when the 
influence of relatives is considered. Recently, the focus has shifted 
from the distance-decay relationship. The personal evaluation of 
spatial opportunities has been proposed as a more critical concept 


(Rushton 1969). However, the importance of distance should not be too 
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easily discarded. Individual perception of distance would seem to be 
a crucial aspect of any personal evaluation of spatial opportunities. 
A more formal appreciation of distance is required, such as the 
derivation of some functional distance, rather than straight line 


measurement. 


7. The Mean Information Field 

The relationship between information and distance is best 
appreciated through the conception of the Mean Information Field 
(Morrill and Pitts 1967). aimtormation Gseatcritérion by which. one 
individual will tell another of an idea, or the way in which he will 
migrate’. An individual's decision will be one of a variety of 
possibilities. Instead of moving from place i to place j, he might 
move to place k. Hagerstrand (1957) assumed that all individual 
decisions are unique, yet have an underlying empirical regularity based 
on the inverse-distance relationship. This empirical regularity can be 
derived if individual behaviour is aggregated. The individual 
information or interaction field can be traced (Figure 3a), and a Mean 
Information Field created by collecting the origins to a single point 
(Figure 3b). Each information field,.although unique, represents a 
chance variant of a general theme. If the variants are collected 
together, then the underlying regularity becomes apparent. In a given 
region, observation of the behaviour of a large number of individuals 


can lead to a frequency count of their movements or contacts in certain 


distances or directions. 


7tnformation, here, is being viewed ian a Jimited sense. For a 
more formal statement of the Theory of Information, see Weaver (1966) 
and Rapaport (1966). 
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an lndavaduals inkonrmaclon Hvela 


b. Mean Information Field 


FIGURE 3 


MEAN INFORMATION FIELD 
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Having derived the Mean Information Field from individual 
behaviour, the procedure can be reversed, and the moves or Panties Of 
a typical individual can be predicted in a deterministic manner, such 
that one quarter of all migrants may be said to move a distance less 
than ten miles. This deterministic statement may be converted to a 
probabilistic expression, such as the distance-decay function. In the 
simulation, the behaviour of typical migrants can be varied by the 
utilization of a random number selection procedure. The latter permits 
irrational behaviour, so eube deviations from the systematic regularity, 


in keeping with the real world situation, can be introduced. 


In actual simulations, one information field for all regions 
in the study area is uncommon. Conventionally, a different field for 
each region is employed, since adjustments are normally made for 


varying population and area sizes. 


8. Distance-Decay Functions 
In any model of migration or interaction, the empirical 

regularity of intensity of contacts varying inversely with distance can 
Descumed sup byercteting a tunction (Morrill 19634), If a graph is used, 
tienstais takes the form-ot fitting a curve to the observations. These 
functions can be treated in two ways, 

a) £(d), the probability or frequency of migration to a band 
area, Which is distance d.» from some origin; 

b) p(d), the probability of migration between unit areas which are 
distance d apart. | 


The form chosen depends on the results required or the technique used. 


In considering interregional migration, the p(d) form seems appropriate. 
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Phew teeiniugienrol Lrrling=a cumecron to "tabulated data ts "treated in 
detail in Appendix A. Reference to the graph (Figure 4) indicates that 
the number of contacts per unit area falls off with increasing distance 


away from a point of origin. 


A variety of functions has been proposed by Morrill (1963a) 
and Marble and Nystuen (1963). The more complex Gamma, Gaussian, 
Fourier, and Quadratic varieties provide certain theoretical advantages, 
which are often outweighed DYStoerOlihicudty) Ol fi eting them. ine 
easiest, the log-normal, exponential, and the Pareto and Pareto- 
exponential forms are most frequently used. They provide fairly good 
results and can be given a linear form by log transformation, so that 


the parameters can be estimated by least-squares techniques (Appendix A). 


Although, the fitting ofsa function is essentially practical, 
each function carries its own assumptions about the underlying data. 
For instance, the normal distribution would appear adequate if people 
travelled completely randomly over all distances and in all directions. 
Tiepractrce,; tiere 1S a Lendency £or moves to =be™ concentrated over Snore 


and very long distances. The log-normal form, 


p(d) = OR (2.3) 


and the exponential 


p(d) = ce | (2.4) 
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where, d is the distance between points, c and b are empirically 
derived constants, and e is the base of the natural logarithm, 

have been shown to underestimate close-in movements, whereas the Pareto 
function overestimates such movements (Malm, Olsson, and Warneryd 1966, 


p. 10). Sometimes Pareto-exponential forms have been utilized, 


p(d)s=hcd <c (2.5) 


but the most commonly used expression is the Pareto (2.6), adjusted to 


take area into consideration, 


ine) 5 (206) 


where p(d) is the probability of contact between regions, d is the 
distance between regions’ centres, A is the area of the destination 
region, and c and b are empirically derived constants. The value 

of the Pareto function has been demonstrated in many cases (Hagerstrand 
1957, p. 126), and the exponent b enables results from different 
studies to be directly compared (Olsson 1965a, pp. 25-26). The exponent 
is significant in that it reflects the gradient or extent of the Mean 


Information Field. 


The distance-decay function can be employed directly in its 
mathematical form in a model, or utilized in a special probability grid. 
tf the latter technique is used, then the poor predic¢tion of close-in 


contacts is eliminated by taking an empirical reading for the cell 13 
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FIGURE 4 


PARETO FUNCTIONS 
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location. With the former, some modification to the function can be 


made at short distances8, 


9. Space Preferences 

The assumption behind a distance-decay function is that there 
is a symmetrical fall-off in intensity of contacts away from some source. 
In practice, this is rarely the case. Because density of population 
is considered important, probabilities are weighted by the population of 


Ene SOUR Cie CSsieiinehewer) euaeel, Wools aS} cl CligireleireiedSinie! Oe ela 


deterministic gravity model as well. 


Other factors, including physical layout, socio-economic 
components of population, and communication links can disturb the symmetry 
(Malm,’ Olsson, enamel nervarlooo, pp... ll-L2)ee lt Vs) ditracult to 
integrate such variables into functions, and still maintain the 


theoretical significance of the function. 


One alternative is a suitable transformation of space to obtain 
a functionally based distance. In the absence of such a value, however, 
Space preferences have been dealt with fairly simply in most models. 
Physical obstacles have been introduced as barriers, which reduce the 
intensity of contacts or the number of moves. Other attempts have created 
functional barriers (Yuill 1964), while multiple tellings have been 
employed as measures of psychological resistance, reflecting socio- 


economic chasnacteristics (Pitts 1963, p. 114). 


Sone example of modification to the Pareto function over 
short distances is Pitt's (1967) MIFCAL programme. 
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tnygeneral, alterations to the basic) probability of interaction 


between regions can be summed up by (Pitts 1963, p. 114) as 


1 eye dts cree aemeccmte-fmich bar 'o) (2,7) 


where a is the ‘modified interaction between regions j tand i, 
Ps is the smoothed probability between j and i, He is the 
Popuvation of region? %, 7 and. Bis a barrier equal to 0.0 if the 


barrier is absolute, and 1.0 if non-operative). 


Marble and Nystuen (eesece 108) have shown, that in the case 
of household travel movements in Cedar Rapids, there is a strong 
directional bias towards the Central Business District. If this is the 
case in other studies, then information fields should be modified to 


include directional bias. 


Conventionally, the diffusional process is seen as a wave 
moving)out from a centre(s). The idea of filtering down an urban 
hierarchy is introduced as a modification (Hudson 1969b). To some 
extent, this can be envisaged within the migration process as well. 


> 


The Construction of the Monte Carlo Model 


The proposed procedure is illustrated by means of the flow 
Gnart (Figure 5). Whe inputs include the adjusted population Ap of 
each region at time t , the number of potential migrants mj from 


each region during t to t+ 6 _, the area of each region, a subroutine 
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which calculates the distance between regions, effective barriers, 
the number of generations and simulations required, and a routine 


which averages out the results of a number of simulations. 


The preceding section has treated in detail most of the 
features of the model. However, discussion of some of these features 


should be expanded. 


1. The Adjusted Population: Operational Mode 

The necessity of treating time in discrete periods and 
segregation of the elements of population change has already been 
mentioned. Since the study is essentially concerned with internal or 
local redistribution of population, it seems reasonable to accept natural 
increase and external migration as contraints to the model given from 
outside (Morrill 1965b,p. 101). How are such constraints fitted into the 
model? The author wishes to emphasize again that no operational 


procedure can be completely accurate. 


Por cach eregionein each) tame period, §t to. t+ 8 7 ithe 
estimated natural increase is added to the Bree eee AteLine. «tunmeluts 
new value is then adjusted to take external movements during the time 
period into consideration. The adjusted population Ap becomes the 
populationedteregi ons] pduring \timerperiod;, stogtosttet)S i; Erom which 
a certain number of migrants mj will move. Thus, 


Ap = (P,+N,) + E (2.8) 
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where Ap is the adjusted population, a is the population at the 
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beginning of the period, N is the natural increase over the period, 


s) 
and Ey techie netlextermal gain or loss. 
Migration data calculated from Census and Vital Statistics are 
net figures. The net figure for each region contains both external and 


internal components. These were segregated by using the sample as a 
microcosm of the settlement process in the Peace. A history of 
population growth and distribution was created by using the sample 
individuals. From this, the external and internal components of migration 
in each time period, for each region were estimated. Thus, actual 
population figures indicating a net migration total are equivalent to 
(External Gain + Internal Gain) ~ (External Loss + Internal Loss). If 

the relative percentages of movements are known from the sample, then the 


met external gain or loss can be estimated. This is equivalent to Ey 5 


No account is taken of internal moves because they are simulated. 
The new population of a region Np at the end of a simulated time period 


Aeneciaie tos the eld; Ap, ~plusigain or loss of migrants yunj 


2. Potential Number of Migrants 

Assuming that a standard or adjusted population for each region 
in each time period can be derived, what proportion mj of that total 
can = expected to move internally (including movement within that 
region)? Obviously, such a figure has theoretical implications and 
should reflect the relative attractiveness of region j in relation to 
all other regions i , taking into consideration factors of employment 
opportunities, economic characteristics, age and sex structure, and so 
forth (Morrill 1965b, p. 97). The implications of such a figure are 


noted but in this study the derivation of mj's by substantial 
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43 
theoretical reasoning was not possible because of the general paucity of 
relevant data. Instead, the percentage of migrants moving internally from 


any region was estimated from the sample. 


3. Probability of Interaction Between Regions 

a) Raw Probabilities 

Probabilities were based upon a Pareto distance-decay function. 
This function has generally performed well and gives added interpretation 
to the migration process through the exponent b . As the distance 
between centres of regions was considered, a p(d) form was used. 
Expression (2.6) gives the raw probability of contact between areas. 
Owing to the irregular size of regions, the function was employed directly 


rather than through the special probability grid. 


b) Smoothed Probabilities 

The raw probability values were converted to smoothed 
probabilities, P. , by summing all the probabilities in the migration 
Preto Ores CO UnIcy, cee, ae the raw probability of contact 


Pune asa Cractilom OL, UnLTY. 


c) Population Correction 
The third stage requires the influence of population size of 


regions to be incorporated. Thus, 


2, eS ee (2.9) 


where E,, is the adjusted probability of interaction between regions 
ay ae 
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44 
jis andi ia, P. is the smoothed probability, and He is the population 


OpMregiont® 15%, 


4. Random Number Selection 

The adjusted probability of movement between regions j and 
i Sisvconverted to a discrete tally of random numbers. This range of 
numbers is relative to the ranges of all probabilities in the migration 
field of j . The programme selects a random number rj which 


determines! the destination of the migrant mj \. 


2. barEeiers 

The rj value is either accepted or rejected depending on the 
presence or absence of a barrier to movement between regions j and i 
No evidence of directional bias was evident, so directional corrections 
did not have to be made. The actual process of settlement does show 
that different regions, at different times, have only sparse settlement. 
Piepart, this 1s due to unfavourable physical characteristics, but also 
eomtne progression of agricultural Settlement. In keeping with this, a 
certain number of barriers to movement have been incorporated, so that 


the simulation remains more faithful. 


The incorporation of barriers illustrates one modification to 
the symmetry of the distance-decay expression. Note that a barrier 
between regions j and i does not necessarily imply the reverse, a 


barrier between i andj. 


Gea ame 
Two aspects are involved with time. In any one time period, 


how many generations in the simulation are required if one generation 
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is equivalent to all possible migrants moving once? Using the sample, 
the number of generations in a simulation was made equal to the average 


number of moves made by actual migrants in that time period. 


The second aspect involves the stability and variation of 
eee ies through time. In each simulated time period, the raw 
and smoothed probabilities were considered constant for all generations. 
Since each new generation results in a new population distribution Np , 
population adjustment to the basic probabilities was made for every 


Succeeding generation. 


In practice, the time aspect could be ignored, since the average 


number of moves of migrants was less than two for all time periods. 


7. "Averaged" Solution 
To approach the omega distribution, the results of fifty 
simulation runs are averaged out. A sub-routine calculates the mean 


population of each region in each generation. 


The general layout of the programme is indicated by the flow 
Chart (Pigure 5)... Detailed input characteristics are considered 1n 


Chapter Four. The programme computes for each generation: 


a) the number of migrants from each region to every other region 
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46 
b) the number of migrants moving from each j source to all i 


destinations; 
n 
een (ert) 


c) the number of migrants moving to each i destination from 


all’ =} sources: 


yng (272) 


dee the net gain or loss Of migzants in a région “k 


n n 
qc y mjd Yomi (is) 

5 i 

n 


where gk is the migrant gain or loss, ) mji is the net gain, and 


5 J 
iS Mj eto the net: loss: 
i 


e) the new population distribution at the end of a generation after 


all migrants have been redistributed 


I+ 


Np = Ap gk 


where Np is the new population, Ap the old, and gk is the net gain 


or loss of migrants; 


peel 
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£) the "averaged" population of each region after a series of 


simulations. 


The results of the Monte Carlo model are given in Chapter Four. 


Evaluation and analysis are presented in Chapter Six..- 


CHAPTER THREE 


MARKOV CHAIN MODELS 


Not all stochastic models of migration, diffusion, and 

Similar processes are simulation approaches based on sampling techniques. 
A variety of analytic models has been proposed, such as the information 
flow model of migration (Berry and Schwind 1969), epidemiology (Brown 
1968a), and random and biased net constructs (Brown 1968a). Although 
these analytic models are applicable, determination of the parameters 

is often a basic problem. Brown (1968a, p. 75) described the process 

of finding parameters for his suggested distance-biased net model of 


diffusion aS a’series o£ hit-and-miss trials. 


The estimation of parameters often involves circular reasoning, 
in which the parameters of the theoretical distribution are derived 
from the observations which the theoretical model should generate. 
Such criticism has been levelled against gravity interaction models. 
Since the weights of the masses and the distance are derived directly 
from observations, it is difficult to provide such weights with theoretical 
significance. Conventional use of distance-decay functions in Monte 
Carlo methods is also not exempt from this criticism. Where such 
functions are employed, the parameters are derived from the set of 
data under consideration. The nature of observations directly 
influences these parameters. Hence, different exponent values in the 


: : | 
Pareto expression have been derived in various studies. | 


The weakness of many conventional analytic and simulation models 


has encouraged geographers to seek new methods. The possibility of 


we 


 Wiotwe Bad 
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applying Markov chain models to problems in geographic research has been 
tentatively explored (Olsson and Gale 1968), though applications are 
still relatively infrequent. In the context of interregional migration, 
Rogers (1966b) noted that the use of a Markov Chain model permitted the 
population process to be separated from the population in question which 
is undergoing the process. Subsequently, Compton (1969) nas usedva 
Similar model to describe the effect of a migration system upon a 
population structure, assuming fertility and mortality factors to be 
Neld constant. In Chis study, eaetintite Markov chain model is applied to 
interregional migration in the Peace River District!. 

A finite Markov chain is a stochastic process which moves 

through a finite number of states, and for which the 

probability of entering a certain state depends only on the 

last state occupied (Kemeny and Snell 1960, p. 207). 

A Merkoy chain Consists) Off nm discrete states of being?. 
The probability of some phenomena being in a state at time t (or 
Gfterst discrete teime intervals) 1s given by Sie = (s),78 
where aay Genotes tiesprocability Of being injstate 1 "at times t . 


Since, phenomena can be distributed only among the n discrete states 


at time t , then 


n 
Safe a (201) 


lImost of the material on Markov chains has been derived from 
the work of Kemeny. These include Kemeny and Snell (1960), Kemeny, Snell, 
and Thompson (1956), and Kemeny (1959); note also Bharucha-Reid (1960). 


2n state could be regarded as a class, a region, or any 
Suitable category. 
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The probability of moving from state i to state j within one 
discrete time period is denoted by Pio - these transition 
probabilities form a transition matrix, P. Bach row of PF. is a 
Propensity veCtor, since phenomena in state i at time t can move 
only to each of the remaining n- 1 states, or remain in state i 


by time t + 1.. Hence, 


n 
Woacte a asl (3.2) 


The matrix P , composed of transition probabilities, can be written 


as 
oe Sc 
1 J 
Pio. Pstie 4284 
pe | (323) 
! 
ie Seine | 
Wie Gossrulce Ss tabec are aS sandy Seet, ANG D4, Dp... and So forth are 
i 5 shot 15] 


the transition probabilities. 


The fundamental Markovian property is that a particular outcome, 
occurring in a certain state:at a certain time, is directly dependent 


upon the preceding outcome in the previous time period. Let x be 


the state that a phenomenon is in at time t , where x is a random 
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variable, whose chance of occurrence is denoted by a probability 
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measure’, The conditional probabilities can be defined (Hammersley and 


Pandecom> 1964, ». 113); 


P(x =S) |x =S Gas pO ee Oe 
E Codapivet Lon inte Lynda 
(374) 
= P(x =S. |x) =e ) 
ie -1 
ye aoe 
and 
iS Sra Se. =) 2 = = 
Py5 = P(S;>s,) (x, S.1%. 4 Ss.) (3.5) 


A characteristic of some types of Markov chains is the derivation 
Otaarstale Ofastatisticalsequijabrium: »Interpretation of) this equilibrium 
is valuable in geographic problems. The derivation of equilibrium 


SOlUETONSmWaslloe EGedtedminudetal ly ine ad alatenssection. 


Markov chains are not entirely exempt from the criticism of - 
circular reasoning. Compton (1969) estimated transition matrices from 
population data. Clark (1965) also derived matrices from the data under 
consideration. Even where matrices are developed through the use of 
distance~decay functions or gravity interaction expressions, there is 
still the problem of giving meaning to the parameters of these 


distributions. Consequently, the derivation of a transition matrix 


| 


32 random event has a chance of occurrence. Its probability is 
a measure of that chance, ranging from complete certainty that it will 
occur (unity), to complete certainty that it will not occur (zero). 
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remains an important issue in the creation of Markov chain models. 


Markov chain models do not easily permit the introduction of 
other significant variables. Olsson and Gale (1968) proposed the use 
of q-dimensional matrices, though this raises the question of how to 
define appropriate multiplication rules. However, Rogers (1966b) has 
suggested methods by which age and sex structure can be incorporated 


into matrix operation. 


Despite these limitations, many problems can be approached by 
a model, which explicitly states that a particular event is directly 
dependent upon the preceding event. Golledge (1967) suggested that 
the Markovian property is extremely relevant in dealing with the 
market-decision process. Such a process is characterized by a search, 
in which a learning sequence gradually produces a habitual response 
patterns Bach decision isiinfluenced by the preceding decision. “The 


habitual response can be regarded as an equilibrium situation. 


Consequently, use of Markov chain models is steadily 
increasing in the Social Sciences. Among economists, Hampton (1968) 
developed a transition matrix showing the probability of movement of 
industry between regions of New Zealand. Smith (1961) utilized the 
technique to consider interregional trade. Harris (1968) proposed the 
model as a means to deal with the question of urban growth. The model 
could be used to indicate direction and timing of city expansion. 
Among geographers, Brown (1963) has developed a Markovian model of 
diffusion while Marble (1964) derived equilibrium solutions of trip 
movement structure. Rogers (1966a) saw equilibrium distributions of 


population as targets or guides for government policy on interregional 
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DS 


migration. 


There is a need for further exploitation of the model. The 
gaps left by simulation techniques are considerable. Markov chain 
models may be one means of filling them. While this study specifically 
deals with migration, the implication is that the model could be 
extended to other forms of spatial behaviour or interaction. Before 
turning to the construction of a Markov chain model, some relevant aspects 


OLEneNeSsermodelis will ben mllustrated. 


Regular Markov Chains 


Regular chains are an important class of Markov chains. "A 
Markov chain is called a regular chain if some power of the transition 
matrix has only positive elements" (Kemeny 1959, p. 391). A regular 
chain is ergodic, though an ergodic chain need not be regular. "A 
Mazkoveciain is ‘Called an ergodic. chain if it is possible to go.from 


every state to every other state" (Kemeny 1959, p. 394). 


To observe a Markov chain process, the transition matrix and 
iiemstarting state are specified. The process moves through a set of 
states successively. If the states and the transition matrix can be 
properly defined, then the behaviour of the system through time can be 
computed (Kemeny and Snell 1960, p. 208). The possible sequence of 
movements between states, and the probability of being in any state after 
so many steps or discrete time periods can be derived. A simple 


example will clarify this description. 
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. ee 7 ) a) . Ass : = ' 
i if i S 
- . 
i yore ta 
{ 
- ad) 
=, - a ‘ 
: 
7 
- 


; e | a . — as 5 
7 edt leteu ait} Io ndidediolgxs tel 20) Span eee, 


siento Youlus” .oftessbhanas Sye bOupiniher ne 


+3 clam ac mated youde abid «Tide “as pasil ys ta HAST wart (Alt. OS. 
1 
aie — febem oft ads. et saivuc tZemt sot Nolte tw foot 


_ 


= / : > 
a 


excdsa .nolssexwodyi No. sweivbdad -lotseqe to sagt SS 


eiseyes annvols? sme ,lebom nindo voyxs! 2 30 nono aaaee sit oF 7 


- 
ebatesvijevlli sd Liew @iabed owas ie 


+’ 
\ x = 7 : eniegt) vonbas 
A" Atnialo vosupN 30 suai tnsdyoqmk te e456 adietls’ 262pee 
; : ’ 


nodzbennss 0) 2o r9¥og Soe Vi nicdd wipeet a boliep al aint 


iwsitpe: A. (LOE va: (Oe veromel) “eiastsls evidiaog Ginn Bai 

A" ~alipess 2d toa bobo misho*sibopie tse peow > tbopiie ai’ a 
= . 

modtl op oF aldiecoq 21 Jt Bi ajtans orhoprs he DORLES zi aigilouw 


«(D@E .q , CCL yusmed) “sista weit yxsve OS Smee 


bie 4£10s mobiiadasd 4fd, ,czuasciy nisdp voAreN 6 evrsede oF he ¥ 
; i) oat =. . 
Wo See 6. Tpworls ecovom sescc1s ott ,hoeliiveqa Sin eiase paitses@ oa c 


od tno Kitver nolsiensss sd Sos.cogete oa 21 phevee 

od fis emit tewour! setéye sid Jo xtusiveriod srl? netiz’- 
9% eoneupbs sidtuacg eft . (GOS .q coe! id one Bers ilies 
$n stats yne al galsd to yatiiduciise oils) ine angmee 4 


signie A bev ib, 2 Ase ehh vag Spies sso tnegh, 


| letvint a2 |. 22 bos 2 \aegese Oe 
s tL ae 


54 
(o) (Oo) a (oO) (0) 


Probability vector isn jp 7 (py Py ), where Py denotes the 
probability of phenomena being in Sy at this initial stage, and ae 
Cenotes tie probapility of beingrin S. =) Gee the transttion matrix be 
) 25) 5 
Sy r a Wes) 
P = 


Knowing the initial distribution of phenomena, what will the 
distribution be after one step or discrete time period? These 
probabilities can be estimated by a simple procedure of multiplication 
of conditional probabilities (Kemeny 1959, p. 390). The probability 

(1) 


Py peol being in Sy alter one step is 


Ce) ae) (o) 7 
Pj = Py 5B ar Po os dk b 


and the probability of being in S, after one step is 
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which satisfies the above equations. The multiplication of the matrix 
P , by a probability vector p , considerably reduces the calculations 
Pequ fooecQ cstimace the tprobabmiity of being in any state atter. n 


steps. The numerical example given in Appendix B will further illustrate 


these comments. 


Torcenealsoebe shown (Kemeny) 1959, 9. 402)), that given 


expression | (3.7), then 


pin) x yo ope (SES) 
and 
ae pela = bP obe war) (ne seactors) (3.9) 
Tf the vector yp! , Of initial probabilities is multiplied by the nth 
power of P , then the oa pM) is obtained, the components of which 


give the probability of being in state j after n steps. Thus, 
Dp. pecie. ijth “entry of matrix pe ,; States the probability of the 


process beim moms avesar) Mafter/s ne steps, ai at started in state a 


Matrix multiplication provides a convenient way of obtaining 


the desired probabilities (Appendix B). The probability of moving from 
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56 
state i to state j is always the same. The constant or stationary 


nature of transition probabilities is a feature of Markov chains. 


Equilibrium Solutions 


The attainment of a state of equilibrium is one of the most 


(n) 


important cnaracteristics Of regular Markov chains. The vector ) 

is obtained from the vector gan! by multiplying it by the matrix P. 
“1 : ; = ; ae, : 

The vector nit ) is obtained from yin a) is a similar fashion, and 


so on (Kemeny 1959, p. 402). The vectors obtained by this linear 


transformation can be indicated by 


4) <2 48) a ye nae reves |) ey) =e (3.10) 

) 1 i 2 

He sVeCcor rie TsesenLyOn tO! the VeCclor i! ' \ on to p! 
and so forth. It sometimes happens that there is a probability vector 


4 , which is sent by the transformation of P on to itself, that is 


a= ey (ia) 


This vector t is termed the fixed point of the transformation P. 
It follows that the powers p approach a matrix JT , each row of which 


is the same probability vector 4 (Kemeny 1959, p. 416). 


Suppose, there is a system composed of three states of being— 
Rain, Nice, Snow (Kemeny 1959, pp. 384-438). Assume that a discrete 
time period is equivalent to one day. The probability of moving from 
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one state to another, that is of the weather changing by days, is 
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That the powers P approach a matrix T can be seen since 


Here, are yeaenotes the, probability of the process being in.State: j 


etter eight Steps, sit) itestarted inastate. i. .~§ 1 hepprobability.of having 
a nice day eight days after rain is 0.200. This is nearly the same (to 
three decimal places) as the probability of having a nice day eight days 
after snow or nice. Obviously, this is close to the equilibrium 
solution. Such a solution can be determined by solving a system of 


simultaneous eguations. From (3.11), 


| % 4% % 
ibe 1 ae 
(t,t, ,t,) su heeds (t),t5/t,) 
A es 


then 


neaviv 


58 


t+ to + a 

il 2 ee 
at, + a, + ee a 
at, + “at, = t, 
vats + “at, + oo — t. 


The unique solution is ft = (0.4,0.2,0.4). The matrix T is 


The probability of the process ending up in state j is independent 
of the starting state. Irrespective of whether the process starts in 
R, N, or S, the equilibrium solution assumes that the probability of it 


Cndiog up eis hs always) O24", in’ Nis 90.2") *and™in®’ S is 0.47. 


How is this statistical equilibrium to be interpreted? The 
unique solution above implies that it will rain 40% of the time, 
snow 40% , and be nice 20% . In the case of interregional migration, 
such a steady-state will be reached when out and in migration from each 
region are balanced in such a way that the system remains stable. No 
region will increase or decrease in size relative to the others, once 
this state is reached. The steady-state reflects the ideal or 
theoretical distribution which should emerge over time. Markov chains 
are dependent upon time, though once the equilibrium state is achieved, 
they become time independent. The equilibrium solutions can be derived 


without reference to the initial distribution. 
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Them Colvective Process 


The individual process has been considered, in which the 
ae entry denotes the probability that an individual who started in 
(newertens state will be inethe “jth “state after mn steps. The 
complementary collective process is more relevant to the study of 
interregional migration (Kemeny and Snell 1960, p. 209). In the collective 
Process, stie “jth entry of p? represents the fraction of people who — 


started in the ith region and will be in the jth region after n 


intervals. 


Assume that a transition matrix has been derived showing the 
probability of movement between three regions (Rogers 1966a, p. 207), 


such that 
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The inttial distribution of population is given by p (oar ie’ TD: 


where “ of the total population are located in region one, and so on. 

In practice, it makes no difference if absolute figures are used instead 
(CO) dees (0) : 

GEet raccoons. Thus; =) me ( Lon 7. 10,9. 0) ewhere » contains the 


population of the regions. Using (3.7) the distribution of population 


after several intervals can be calculated. 
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Prone on) merheonectiminoriuin solution 1s <= (10.84514-34°10.84). “This 
distribution is arrived at regardless of the initial conditions and is, 
therefore, a steady state. It represents a balanced condition in which 
inputs and outputs from each region are stable. Over time, region one 
will come to contain 10584, region two 14.43, and region three 10.84. 
Once the state of equilibrium is reached, these amounts will not vary. 


If the vector # is multiplied by P , the same answer will result. 


Absorbing Markov Chains 


In the simple Markov chain all processes are reversible. 
This need not be so. Labour and capital turned into a road cannot be 
ESCreaced oy breaking up the? road (Harvey 1967, p. 580). din regular 
Markov eee the process can go from every state to every other state. 
Absorbing Markov chains contain at least one state from which it is impos- 
sible to leave once reached, and from every other state it is possible 
to go to an absorbing state, not necessarily in one step (Kemeny and 
Snell 1960, p. 208). An absorbing chain can be recognized by the unity 


values in the appropriate Das entries. 


Suppose, there are five regions between which migration can 
take place, except that one and five cannot be left once entered, but 
can be reached from all other régions. A matrix “P ‘can be written 


(Kemeny and Snell 1960, p. 45), 


tw m4 


ad 
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Conventionally, the matrix of an absorbing chain is written in the 


canonical form, 


r states s states 
r states I @) ] 
Ps ——_— al Gn} 
s states R Q 


in which exist r absorbing states and s non-absorbing states. 
The matrix is partitioned into four sub-matrices. I is a rx r identity 
Mocrix witieoness in the diagonal, UO is a r x's zero matrix., R , 
the s X r matrix, concerns the transition from transient to ergodic 
states, while 9 , the s x s matrix, concerns the process as long as 
it stays in the transient states. 
A transient state is an element of a transient set ... 
A transient set of states is a set in which every state can 
be reached from every other state and which can be left ... 
An ergodic state is an element of an ergodic set ... An 
ergodic set of states is a set in which every state can be 


reached from every other state, and which can not be left 
once it is entered (Kemeny and Snell 1960, p. 208). 


The above example is written as 
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3 4 

IL 0) O 

5 O O 
P=2 lO 
3 owe 

4 ee ae 


From this can be derived N , the fundamental matrix for a given 

absorbing chain. The fundamental matrix provides considerable information 
about the absorbing chain. It can be used to state the probability that 
the process will end up in a given absorbing state, the average length 

of time for the process to reach an absorbing state, and the average 
number of times that the process will be in each non-absorbing state 


(Kemeny and Snell 1960, p. 404). Here, it is merely stated that 


Ss Ge 


; =i 2 : 
where N is the fundamental matrix, and (I-Q) is the inverse of 


the matrix (I-Q). From above, 
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This section has been included to familiarize the reader with 
the terms and technicalities of an absorbing chain. In any realistic 
assessment of an interregional migration system, where natural growth 
and external movements are considered, an absorbing chain seems more 
appropriate than a regular one. Use of an absorbing chain poses 


certain difficulties to the construction of a model of migration. 


Construction of the Markov Chain Model 


The aim is to create a model, which will develop an equilibrium 
population distribution for the twenty-six regions in the study area in 
each time period. How is the affect of population and external movements 
on the population structure to be incorporated in the model? What methods 


can be used to derive the transition probabilities? 


i Population Growtn 

Only if population remains stable can a regular Markov chain 
model be used. This involves the patently unrealistic assumption that 
birth, death, and meee rates balance perfectly. If births and 
deaths are included the chain becomes absorbing, since the former can 
Hout bewenterea fL£romeany Other State, nor can the latter be™ierty “If 
death is included as a state of being, conventional techniques imply 
that, in the equilibrium Ration! everyone will end up dead. Rogers 
(1966a,pp. 207-209) has proposed a way out of this dilemma. He 
suggested that a transition matrix P be developed with death as the 
nth state. The chain would be absorbing. Define a vector of births 
\emtoreeachn seglon. Wiiittidl distribution o£ 
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(ea Meo tete 1S ignored, except [or its impact upon the transition 


probabilities. Remembering the canonical form of P, (3.12), and 


(0) 


given W end |? ye then 

en ei) (3.14) 
where Q is the set of transient states. This gives a new population, 
=(o) ; (o) ; 
W , which will be smaller than WwW , Since some people will have died. 


MrmeenesveccOG, ox ,» 1Ssuadaed: to m9 (2) pene distribution of Tae 


population, including births, at the end of one time period os obtained, 


we 


Ge 2 Ge) ae Sle se (3.15) 

wy) os (w!?) 94x) 0 + x (3 6 ILS)} 
and 

wD) 2 wr? oh i NO ere ay (3-17) 


n (o) ,n 
HoweverpjesJe ~80 tas mf increases; ,and \W -Q >-0 also as n 


(n) n-l 


increases, which leaves W > x(1+0+...40 ~). As n increases, 
Tee et Ou 23 (i203 , and the limiting or equilibrium distribution 
is given by 

(n) -1 


Y= limw' = x(1-Q) | (3.18) 


A numerical example will illustrate the procedure (Rogers 1966a, p. 208) 


Consider a system of three regions, A, B, C, and a state Of death, D, 
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A B C D 
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Consequently, in the equilibrium distribution, regions A,B, and’ C 


wWEll contain 9, LO, and 7, respectively. 


Me vabvove jrocedure in modified form is followed in this study. 


2) -Extérnal Migration 
Roger's model considered only local migration and natural 
growth. No account is taken of movements into and out of the entire | 
area. A reasonable approach would adjust the X vector to include external 


migration as well as births, while the nth . state, D, could incorporate 


a0 
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death rates’. 


aoe lve transition Matrix 

Conceivably, a matrix could be derived by reference to the 
movement of sample individuals. This approach was rejected on two 
grounds. Inspection of the results indicated that the pattern of 
movement in the Peace is such, that not all regions sent migrants to 
every other. Consequently, many cells in the 26 X 26 matrix were 
left blank. A transition matrix should be ideally based on some theoret- 
ical criterion. Accordingly, a simple gravity interaction model was 


used to derive the transition probabilities, 


2 n 7 
I. SG ey) ) oer (Sse RS) 
: D,. hel D, 
Ss] ih 
where Be is the interaction between regions i and j, me is 


the distance between i and j , ae as tthe population of ya, 2 
is the population of region h where h = (1,2,...,n). Dea is the 
Gistance between h and i. , and b and g are empirically derived 


constants. 


Obviously, the parameters of the gravity model are open to the 
criticism of circular reasoning, but at least some effort is made to 
provide the transition matrix with theoretical significance. The 


distance varvable in (3.19) is straight-line distance between the 


“this vector X almost always contained positive elements. 
Where it did not, the X value was considered zero, and the net 
population loss was added into the déathn rate. 
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the centres of regions °. The masses were taken as the population of 
regions weighted by area®, In view of the controversy over which exponent 
b values should be used (Olsson 1965b, pp. 57-63), the values used in 

the model are derived from the Pareto expressions of the Monte Carlo 


model mul heaconstant wag suwase simi lagly dexived, 


For each row, a value from the gravity model was obtained 
for each Pes . There are two exceptions. The D value was obtained 
from death rates, while the Pa; values were estimated from sample 


data. The cell values for each row were then normalized. 


ae Stability of Transition Probabilities 

Transition probabilities in the Markov chain process are 
stationary. They do not vary within each time period. Feedback and 
changing inputs will lead to a disturbance of such probabilities. 
Accordingly, a different transition matrix was derived for each of the 


four time periods. 


The results of the Markov chain model for each time period 


are set out in Chapter Four. Detailed analysis is given in Chapter six. 


Sother conceivable measures could have been used, but 
apparently offer little more advantage (Olsson slGSD eps 65 i). 


6In effect, the population per square kilometer was used. 
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CHAPTER FOUR 


EXPERIMENT 


An experiment brings the models and observations together in 
this chapter. Discussion of the real world situation in the Peace is 
followed by presentations of detailed inputs to the models, and of the 
results of the experiment. Evaluation of the models and the experiments 


is reserved for the following chapter. 


The Peace River Study Area 


1. General Background 

The broadest definition of the Peace District covers an 
immense area in the Canadian north-west. Extending west to east from 
Hudson Hope in British Columbia to Lesser Slave Lake in Alberta, the 
region stretches north from the Wapiti River to Fort Vermillion which is 
close to the great Athabasca Lake (Kitto 1930, p. 1). The study area, 
for practical reasons, is much smaller (47,350 sq. kms.), and embodies 


the core of the Peace District (Figure 1). 


Basically, the Peace District, a plateau with a prevailing 
elevation of 2000' to 3000', is composed of glacial, fluvial, and 
lacustrine deposits laid ago on the horizontally bedded limestones of 
the Albertan Syncline. Relief differences (Figure 6G) are due either to 
the deep incision of rivers or to prominent uplands, which are remnant 


Pile olatoseys00' eto, 1200’ above the™plateau (Gite Vohiies SEIS Oke eile 
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FIGURE 6 
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Despite a northern continental location between latitudes 55° N 
and 57° N, the Peace region is well suited to the growing of cool-season 
crops) (Carder 1965, p..5)% *While late spring and early fall. frosts can 
cause crop damage, and summer droughts occasionally reduce yields, the 
area has a number of distinct advantages. Precipitation values, 
although low (yearly mean 17.54" including snowfall), os largest in 
July (mean 2.44") and are well distributed throughout the rest of the 
growing season. Moderate temperatures during summer (July mean 60°F) 
help keep evapo-transpiration low, and are adequate for growth. The 


‘6 


long summer days overcome the disadvantage of the short growing season! 


Killing frosts were once believed to be a major deterrent to 
agriculture, but agricultural expansion seems to have been conditioned by 
soil rather than frost (Tracie 1967, p. 7). Parkland areas have been the 
most favoured areas for settlement. Such areas are stretches of aspen 
parkland located on relatively level terrain, and developed on better soils 
such as degraded Black Earths (Tracie 1967, p. 8). The Grande Prairie 
(Buffalo Plains) is the largest of the parkland areas. Others are near 
Spirit) River, Pouce) Coupe, and to the north of the Peace River (Figures 7 


and 8). 


Prior to the early part of the 20th century,European interest 


in the area was confined to an almost legendary group of traders, 


missionaries and prospectors*. The isolation of the Peace prevented 


lone temperature and rainfall figures are derived from Carder's 
Table 1 (1965, p. 6). They are based on data collected for the years 
1916 to 1960 at Beaverlodge. 


2~ number of historical texts are available. The most readable 
of these is MacGregor's work (1952). 
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any wide spread of settlement. This isolation was doomed in the great 
westward movement of people after the 1870's, when the first trans- 
continental railway was completed. A railway did not reach the town 
OF Peace (iversuntil’ lOle7ebut its impact was felt long before that 
(Willis 1966, p. 19). The demand for wheat from industrialised North- 
West Europe was high, and railroads provided the means by which people 
could be brought in and the wheat taken out. As more and more land was 
taken up by settlers, new areas were sought. Various reports had been 
circulating about the agricultural potential of the Peace. These 
rumours were confirmed by a number of scientific travellers (MacGregor 


1952) eeo)e 


From 1905 onwards, settlers began arriving without waiting for 
the railroad. The first were ranchers, but these were soon superseded 
by grain farmers. Clusters of settlement developed, including Grande 
Pramriée and Spirit River. = Communication were amproved, first by the 
Edson Trail, which precipitated the area's first land boom in 1911, and 
then by the railway which reached Grande Prairie in 1916. Although, the 
effects of the railway were overshadowed by World War I, access was 
provided to a stable market, and settlers could now come in more easily 


“(Willis 1966, p. 19). 


The population of the study area (Table 4.1) has increased 
steadilytirom L921 (le@,050)F to 1961 (59,256). There were fluctuations in 
population after both World Wars, due to liberalised homesteading schemes 
for war veterans. The drought years of the late 1920's in Alberta and 
Saskatchewan encouraged many settlers to try their luck in The Peace. 


In contrast, there were out movements during the war years, and many 
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TABLE 4.1 - AREA AND POPULATION CHARACTERISTIC OF STUDY REGIONS, 1921-1961 


Region Centre Area Population 


Latitude Longitude (In square 
(degrees and minutes) kilometres) OAL WAL Weal WSysi, Ise 


i! iene 27 54 59 1669 0 24 28 28 128 
2 slap mene 3 55 07 3339 148 A 75m SAGwel 2 17 692522 
3 Tie) Yew) 54 59 1041 2 25 93 55 61 
4 PTOmeeOn 55 15 Tao 104 414 730 851 1074 
5 ioe23 55 22 3297- 1322 2823953159 —A0ct) 4636 
6 Tee 55 30 835 4) BOs GOL GI SNeod2 
7 Wiley 55 29 1156 0 2A ia 908) 127 
8 We WE 55 46 835 Pe 141 2040" 202) 385 
9 ny) 0 55 46 1959 1156 3309 3874 4954 4825 
10 Tio 20 ToT) 1850 0 10 0 10 0 
11 116 59 56 02 1706 251 1106 1249) 1662) 1994 
1 116 56 56 18 1096 1233 Srl 341s 456) 1959706 
13 11844 55 15 2234 4030 7133 7268 8229 13549 
14 ie) 55 03 1420 S520 20268 2112 16145 eat 
15 Lig VaR 55 20 1489 LA SEP Pe Re Se 
16 118 40 55 32 1357 35M 1385-91798 “S77 “1862 
17 Liseeee 2 55 49 1600 430 710 1050 1609 1766 
18 iio 32 55 46 1360 1260 2413 2650 3233 3585 
19 ey ey 55 45 3838 290 1688 2046 2120 1773 
20 Wiley Qe 56 O7 1700 0 32 38 99 317 
o% Sag? as 56 08 897 Mice) ep eye. Dasa’ soup 
ao) fise= 19 5G) § 02 1004 1456 ~~ 252492758 "2960" 3423 
23 hae Hee 2474 46 1525 2069 2300 2273 
24 iniep 26) 56 30 3529 359 1562 1844 1858 1570 
25 118 38 56 38 1669 0 24 80 97 84 
26 119%6 35 56 29 3226 0 Sie <3578 21S: 1736 
Total | 47350 16050 37072 42539 48478 59254 


Source: Canada, Dominion Bureau of Statistics 
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ifs 
people left after the bad harvests of the early 1920's (Tracie 1967, 


pHop23)t. 


In recent times, the Peace has become more integrated into the 
Western Canadian economy. Partly, this stems from the exploitation of 
natural oil and gas, partly, from improved communications (Vanderhill 
1963). The construction of the Alaskan (1942) and MacKenzie (1948) 
Highways and the Great Slave Lake Railway have linked the region to 
resource developments in the North-West. In 1955 a direct paved highway 


to Edmonton via Valleyview was completed. 


Homesteading policies have encouraged the development of the 
area with varying degrees of success. Renewed interest in the late 
1950's and early 1960's led to the opening up of new areas for agriculture, 
including south of the Wapiti River. Yet, while occupied land has 
sae ae the number of farms has declined. This seems to be more the 
result of the enlargement and consolidation of land holdings, rather than 


themactivities of new settlers’ Vanderhill 1963, ps >38). 


Cele cetulenent Process 

The primaLy aim of this thesis is to investigatesthe relative 
efficiency of the Monte Carlo and Markov chain models, not to examine the 
process of agricultural settlement. However, some idea of the gradual 
spread of settlement and transportation in the Peace can be gained by 
examining the series of maps from Figure 10 to Prguresl3.08 Tic most 
significant work in the Peace on agricultural settlement has bean carried 
out by Tracie (1970). He suggests that vegetation cover appears to be 


the most probable determinant of farm location im the period L908: to 19275 
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Apparently, homesteaders were most attracted by the lighter aspen 
poplar parkland areas, where there was at least 20% grassland. From 
1928 to 1968, the proximity of the homestead to settlement centres and 


major transportation links becomes more important. 


The steady gain in population of the study area seems to have 
been caused by the natural excess of births over deaths, with the 
exception of the period 1921 jto 1931, when there was a net.migxation gain. 
In the other three decades, estimates indicate a net migration loss 


occurred, in which out-migration exceeded in-migration (Table 4.2). 


TABLE 4.2 - POPULATION CHANGE BY TIME PERIOD FOR STUDY AREA 


Migration Gain 


Population Increase Waturak inerease SORT EGS 

Absolute =o Absolute %* Absolute 2% 
2 he Urs 1 20,022 130298 Bi fou, 23 441 cies hey es sy/ 
$931-1941 5,467 127 5e 7,486 PAO} AES) pe PA ONS) Syst 4S) 
92 = 195 Spee) Be IG 10,564 24.83 =A Ogee aL. O07. 


Pot-LIGr 10,776 22623 17,941 SEOs Sh, LOD aga a 3 


* 
The percentage figure is obtained by expressing the absolute gain or loss 


for the time period as a percentage of the population at the beginning of 
the time period. 


Most regions of oe study area have experienced steady growth 
Overetiecmper lod 192tcoll9ol (Figures 14, 167, 18, 20). The exceptions 
appear to be the less densely settled areas on the margins of tthe study 
area. Appendix C provides a full description of the ee ee changes 


in each of the twenty-six regions. The location of these regions are 
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Given in Figure 9. 


3. Relevant Literature 


There is a variety of standard historical works on the history 
Of the Peace River country (MacGregor 1952, Kitto 1930, Wellman 1965). 
The most significant work in respect to this study has been carried 
out by geographers. Tracie (1967) defined some of the factors involved 
in the migration of people into the Southern Peace’. These included 


favourable land costs, physical advantages, social contacts, improved 


communications, and Government Homesteading policies. 


Marriott's (1969) study of migration within the County of 
Grande Prairie for the years 1956 to 1967 attempted to suggest some of 
the significant factors determining local migration. Multivariate 
analysis in his study was unable to suggest any significant relationship 
between migration distance and independent variables, other than the size 
of the population centre. Such a relationship between distance and 
population size is summed up by both the gravity model and the Pareto 


functions used in the Markov chain and Monte Carlo models. 


Inputs to the Monte Carlo Model 


The basic requirements of the Monte Carlo model are population 
and area characteristics of the twenty-six regions, the number of 
potential migrants from each region, and a statement of the expected 


| 


interaction between regions. 


3fhe Southern Peace is that district bounded by the British 
Columbian border, the Wapiti, Smoky and Peace Rivers. 
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Area characteristics are provided in Table 4.1. Distance 
between regions is calculated by a routine which measures the interval 
Between the centres of regionms,,;where the location of each centre is 
expressed in geographic co-ordinates. Adjusted population, Ap values 
(Table 4.3), are eae 2 by expression (2.8). Natural increase and 
external migration figures required in the calculation of the adjusted 


populations of regions are presented in Appendix C. 


'.The number of potential migrants, mj, from each region is 
estimated as a percentage of the adjusted population (Table 4.3). The 
percentage values for each time period are calculated empirically from 
sample data. A similar procedure was followed by Morrill in Sweden 
(lI G5o, 7078155) in time period one; 1921-1931, -a value of 15.402 “is 
used. This implies that during the decade 15.40% of the adjusted popu- 
lation for each region are expected to move internally. Figures for the 
mWertode 1031-19041, 1941-1951, and 1951-1961 are respectively 19.50%, 
252903, and 24.00%. )m@he amount of, local migration generally increases 


over time. Morrill's values behave similarily. 


The basic interaction between regions is determined by means 
of Pareto functions, calculated by the method presented in Appendix A. 
These functions (Table 4.4) are utilised in the model through expression 
(2.6). Exponent values have frequently been used to represent 
graphically the extent of the Mean Information Field. Figure 4 indicates 
that the migration fields vary over time, but not widely. The 
exponents are larger than the b value of 1, WhLCh is OLtLen proposed 
for North American studies. They are closer to the results of Swedish 


studies, which indicated exponents varying from -2 to =—J55 ) bOr rural 
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TABLE 4.3 - ADJUSTED POPULATIONS AND POTENTIAL MIGRANTS OF STUDY REGIONS* 


Region Oe ea 1931-1941 


Adjusted Potential Adjusted “Potential 
Population Migrants Population Migrants 
1 24 4 28 ‘53 
2 556 86 620 ait 
3 26 4 107 21 
4 420 G5 555 108 
5 2S) 434 SVE: 610 
6 617 95 590 115 
i] 24 4 134 26 
8 155 24 dy], 3a 
) 3524 543 4025 785 
10 10 L O 6) 
jal 1328 205 1230 240 
EZ. 1395 Bis) 1241 242 
13 695 1075 1OSd 1548 
14 saeys: 354 1319 355 
IES) 3254 501 3609 704 
16 1407 AAT 2095 409 
U7 A 22 dl, 1084 244% 
18 2475 SiSib DENSE) Sow 
19 Ar STeAS) 22 2056 401 
20 a5) 5 4l 8 
ok. Z192 338 2319 452 
de 2063 318 2748 536 
23 1764 Rai 2 2150 419 
24 1397 BAS Lois 354 
Deed 24 4 92 18 
26 193 30 398 | 78 
dhfeg ora g S70 72 sy lies) 42539 8298 


*See Chapter Two, Section IV, for the definitions of Adjusted 
Population and Potential Migrants. 


Penoxea seis’ io mrokelnat vv 


at i een’ —— 


poset 
a ene 
Inking 3a" bs sab 
etrateim = ive tet 


<< — 


ist ofa 
£5. TOL 
sol ece 
Ola aS lt 
ei Oc 
as het 
if ver- 
cSt , @8O> 
0 F 0 . 
GL? ote 
CAS {pS! 
Shei LEEN 
ect Lt 
bOV. eOa 
POS En 
LIS . PROS 
Loa e 
[Oe a20S5 
‘8 it 
Ps eles 
dE2 ERTS 
eth O2ts 
bee EjSl 
Bi £¢@ + 
a: SPE 
; = : Reeth 
~~ — 


a 


Region 


Rey yee) SS eps) kp eS US) Re 


on et NM MeN N BR Pete ge BS ee 
CULMS Nee Oo, OO, SS ol on Se NR oO 


ee ee 


- Total 


TABLE 4.3 


194 


Adjusted 


eho el 


(Continued) 


Potential 


Population Migrants 


24 
1114 
suk 
816 
4200 
964 
66 
176 
4970 
1kS 
1649 
1839 
W927 
1108 
3265 
1355 
2550 
3007 
L233 
27, 
OMA, 
2964 
2365 
2276 
98 
Siete) 


43478 


590 
25 
93 


2599 
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TOS T= 96. 


Adjusted 


Potential 


Population Migrants 


114 
2267 


59254 
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TABLE 4.4 - EXPONENT AND INTERCEPT VALUES OF PARETO FUNCTIONS USED IN 
THE MONTE CARLO SIMULATION 


Exponent Intercept 
iS eae ewe eve Hee ei6) Dredd 
esi LOA | = 2.46 1.047 
RO AL—1957 eh 4.787 
LO bil L6G 1 eee O05 692 


a me Se Ne ee ee a re ne me a ne ee re ene 


Penulacrons | (Hagerscrand 1957) p. 113). "Olsson (19650, 2.60) suggested 
that most exponent values fall between -1.74 and -3.00, for all types 


om alahe(=warslonon Keyale 


Due to the lack of relevant information pertaining to the 
influence of social and economic factors through time, detailed inter- 
pretation of these exponent values is not possible here. Time period 
one (1921-1931) has the steepest migration field, -2.80, though there is 
little difference between it and time three (1941-1951), -2.77. In 
Contrast, speuiods two (7931 =-1941)" and four #(1951-1961) Nave slightly more 


extensive fields. 


In practice, the symmetrical inverse-distance relationship is 
modified by use of absolute barriers to movement (Table Tes These 
barriers were constructed empirically from sample evidence. They reflect 
the spread of settlement through time, and the presence of unfavourable 
physical conditions. For example, in time period one, 1921-1931, no 
region is permitted to send migrants to region 26, Since actual settlement 
during that decade was sparse. After 1931 settlement in this area 


expanded, and the barriers are accordingly removed. While the barriers 
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TABLE 4.5 - EFFECTIVE BARRIERS TO MIGRATION BETWEEN REGIONS 


Region to which 


: : : Regions from which barriers are effective 
barrier is effective ee 


1O21=1931 1931-1941 1941-1951 TWIST 1OGT 
uh All regions All regions All regions 
except 2 except 2 
and 5 and. 5 
3 All regions All regions All regions All regions 
OxXCepis a2 except (2 except 2 
and 4 and 4 and 4 
7 All regions All regions All regions 
except. 2,4, except 274, 
6 ‘and 919 6 and 9 
10 All regions All regions All regions All regions 
except 11 except ll 
abs) All regions 
excepe IS, 
Ge sands138 
20 Pil evegions. All #egions ~Allsreqions 
except 19, except 19, 
Dee seinel 2s) (Sy chsh 946 
Zo All regions All regions All regions All regions 
except, 23, except 923, except 923, 
24 Mand 26 "24" "and ~ 26 ~24 sande 26 
26 All regions 
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have little theoretical meaning, they impart a degree (ot Elem ba lary 


to the Monte Carlo probabilities: 


Monte Carlo Simulation 


The raw probability of contact, estimated by the Pareto expres- 
Sion, is converted to smoothed and adjusted values according to 
expression (2.9). An example of derived probabilities for one region is 
given in Table 4.6. The Monte Carlo programme computes internally a 
range of random numbers relative to the adjusted probability value. 
A random number is drawn for each pore ue al migrant in the region. The 
location of each migrant is memorised. In this particular example, no 
ee rene ere er counter ed. so no random numbers have to be regenerated. 


Ideally, the sequence of random numbers, {Rt, should have a serial 


correlation equal to zero, reflecting the degree of dependence between 


succeeding terms. The routine used to draw numbers has a serial correlation 


@& the order 1.5 x 1Ocee and is considered adequate for this study. 


Similar tables@to™ 436 “are produced’ for all regions in the 
simulation run. The programme presents the gain or loss of migrants for 
Mileregiens, 1m thestorm Of ja) 26°x 26° chart. Table 4.7 1s theyresult 
of one complete simulation run for the period 1921 to 1931. Clustering 
of the largest values about the diagonal indicates the greater importance 
of short distance moves, many of which do not cross the borders of the 
region of origin. Regions with the largest number of values in the columns 
receive the heaviest migration gains. Region 13, which Bact gcc 


Prairie, the largest population centre, clearly has a marked migration 


gain, as does region 18 in which Spirit River, an early growth, point for 


the area, is located. 
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The column totals provide the net migration gain, the row 
totals, the loss. Results of four single simulations through time are 
given in Table 4.8. Each single simulation was recorded, so that the 
average net gain or loss by internal migration could be calculated. In 
total, fifty simulations in each time period were averaged out. The 
means and standard SL Pee of the averaged distributions are listed 


ain GNaloey Zl eyo 


The final step involves the specification of the new population 


cistripution, using-expression (2).13)...9 The internal migration gain or 
Noss, wok, Lor each region, is added to the adjusted population, Ap > to 
give a new distribution, Np (Table 4.10). Averaged simulated population 


erSeri butions are presented in cartographic form in Figures 15, 177 12 
and 21, and can be visually compared with the actual population changes 
(Wigubes gide 6, 18 and 20), (Simulated results are’ tested againstPactual 
observations in Chapter Five, as are the equilibrum solutions achieved by 


the Markov chain model. 


Markov Chain Inputs and Outputs 


Equilibrium population distributions from the Markov chain 
model were determined by the procedure illustrated in Chapter Three. 
Transition matrices (Tables 4.12 to 4.15), indicating the’ probability 
of movement between regions i and 4 ., are Constructed by means joiva 
gravity model (Expression 3.19). The illustrated matrices show only 
n- 1 states. The nth state, which has been presented separately 
(Table 4.15), consists of the region's death rate (D value). In some 


eases the D value is. taken as the region's net population decrease, to 
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TABLE 4.16 - POPULATION DECREASE RATES USED IN THE MARKOV CHIAN MODEL ** 


Region De saves (Percentage) 
1921-1931t 1931-1941 1941-1951 1951-1961t 
1 0.00 0.00 140084 9.05 
2 oe 5.68 8.16" 9.05 
3 0.00 4.00 66.67 9.05 
4 9.38 6.76 6.85 9.05 
5 9.38 6.38 Fool 9.05 
6 9.38 A.57 ee 9.05 
i) 0.00 20.83 £2 10* 9.05 
8 9.38 7.09 1372" 9.05 
9 9.38 7.40 8.34 9.05 
10 0.00 65.10% 0.00 90.00* 
ll 9.38 5.97 7.85 9.05 
12 9.38 10.07 9.70 9.05 
13 9.38 6.31 8.15 9.05 
14 9.38 One Ve 47253" 9.05 
15 9.38 8.25 8.44 9.05 
16 9.38 5705 DAT A2s 9.05 
17 9.38 5.92 Tie 52 9.05 
18 9.38 8.54 8.53 9.05 
19 9.38 ch 6.60 40.70" 
20 0.00 0.00 7.89 9.05 
ohh 9.38 Tae acs 9.05 
22 9.38 7.29 8.30 9.05 
23 9.38 5.90 6.48 AB 80> 
24 9.38 4.35 7.38 9.05 
25 0.00 4.17 5.00 9.05 
26 0.00 1.99 6.72 9.05 


a 


x : ‘ 
The values are equivalent to the percentage population decrease 
over the period. The corresponding xX values have been taken as zero. 


tBirth and Death rates are not known by regions in these decades. 
The average rate for the study area has been applied to the population 
of individual regions. 

tthe percentage figure is obtained by expressing the absolute 


decrease as a percentage of the population at the beginning of the time 
period. 
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TABLE 4.17 - POPULATION INCREASE VALUES USED IN THE MARKOV CHAIN MODEL 


Region 1921-1931 1931-1941 1941-1951 1951-1961 
1 Pe 4 0% 88 
2 A21 We B37 1160 
3 24 83 750 12 
4 325 169 136 391 
5 1619 485 1291 519 
6 547 110 413 190 
7 24 115 o* 45 
8 89 26 Om 183 
9 2476 961 1419 1010 

10 10 ce 1} On 
1 1100 190 498 403 
se) oy) 35 527 1598 
i 3324 1250 1251 5706 
14 1525 OF o* 120 
15 2102 327 536 1200 
16 1086 780 Ow 166 
“ly 332 416 1385 356 
18 ss 362 583 1045 
19 1116 435 88 Om 
20 33 9 92 196 
21 980 357 531 910 
oy 743 408 435 1096 
BS 22 5 430 Ds 
24 1071 319 568 34 
25 24 69 22 2 2 
26 193 250 25 191 


x 
A zero value indicates that the region has a net population 


decrease. The. decrease has been added into the D. value of the trans- 
alien oyausitshen eum el=tse 
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prevent the occurrence of negative values in the xX vector (Table 4.16). 
The xX vectors are normally calculated by adding each region's expected 


external migration gain or loss to the natural increase. 


The inverse of the transition matrix (N, the fundamental 
matrix) multiplied by the x vector gives the expected equilibrium 
distribution of population (Table 4.17). These equilibrium distributions 
are tested against real observations for each time period in Chapter 


Five. 
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CHAPTER FIVE 


TESTING OF THE MODELS 


A theory is necessarily based on more or less true approximating 
ascumpttons. The derivation of assumptions is technically referred to as 
[Re Pp roolemolvspeci fication. (Bartlect 1962, po. 24). In this chapter the 
specification or statistical model is tested against the statistical data. 
The theoretical distribution is related to the observed facts to assist 


‘in the interpretation of the facts and to facilitate further inference. 


What are the theoretical distributions? In the Markov chain 
model, they are the equilibrium solutions derived for each time period. 
In the Monte Carlo model, the simulation results are assumed to be 
representative of an underlying theoretical distribution. These theoretical 
distributions have to be related to actual observations. If it is assumed 
that the real world has only a probability existence, then the observed 
facts can be considered as one possible sample drawn from many. 
Consequently, an adequate one-sample test is desired, which gauges the 
goodness-of-fit of the sample to the theoretical population (Siegel 
1956602055) wesc tests indicate wiether a particulary sample (here, 
real-world observations) could have come from some specified theoretical 


population (the simulation and equilibrium solutions). 


A common technique to test the difference between the observed 
(sample) statistics and the expected (population) parameters involves the 


use of A "t" ‘test. “Such a test requires the assumption that the sample 


calomel as terete 


a 


sb 


ae: ig hae. 

griorieegyh outst atel so esomne "hens d Vilyesagen ages ia 

= 4 = + des ~ 

ah of barr: os. thlaoladsus cet Snotigmees 10 nok tov esas ai? a0 
: 7 a : ae ee 

ei xpigeh> elda nt i (hS .q “Rael sdaftese) nodteoditosge Ge 


t4eb Lectdelocin eft tadlops Based) ei fehon Taurselsesa tw noiys= 
- : - ot - = ; 7 = 
deteen of eins] bavxsedo en of bhjeiss a) nolsveiseals iso 


a _ 
sohnewaini ssitul sundiffrost o3 Bans cine? 279 Zo nolsarongisdat pad; 
r : ‘ : : i 
« - h 


‘ihc Vous -adt RI Tanottucixvte Pte Shtsosty ody tk tate | 


® 
S 
.boiteq anit dosy xo} fe vETSb snoiylea mir) 1hiitaapes ‘ol? ote em) 

ad of Homeces. 644 ,cslveex (cote limic ott, bebam’ohsad tat 9 


faobisaceil! sesii1 Notdudivseth Lleottjeroes paeyvisshinest 30 ovidi 


Bominss e2t 3) 23 -anottevasedoe Isises of Satalexy sd ed aved emease 


> & 


a - Te 
sit , 900s) aixe ol Jedoia « Vino asf bisow Igeas. ody 


ia “ — 


-yoom mo nvexh otemse Sldfaeeg spo 25 Borehteies ed Gao 


; Fs — 
bavrsedo oft 4 7 1 
> a 
: 
act: 


sit ssposp dotiiw ,betivobh et teed sfumarn—onm eseipebe te. | i 
tepaiz) Aoiseluroy lsoidsxonid alt ot slanpe any to 25% 
eSterl) olnman seluotitad s raiitestey ogi ind esaus Mou ee + 

feoitezosiis heryioege omGe mut NID Sveri bide ae 


sfenoisjulos misdilinos Bre dobaek iets ‘ald. | 


7 ‘ ; ‘ 7 Le 
bayisedo sii nowted e>newetttbh odd fad 93 syptadssd ame 


sid seyioval Sasson THK" aa ‘badaeqne beta See ” 


Napee ot Jari) cAStiéavueak dn eeahapes, Fase gious aie 


~ A 
t% 


ae 


130 


observations come from a normally distributed population. (Siegel 
2250, 1D. 22). ©ilnee this asstmption of normality can not be made here, 
the one-sample test used in this study must be nonparametric rather than 


one of the generally more powerful parametric varieties. 


Chi-Square and Kolmogorov-Smirnov tests are frequently used 
as one-Sample tests. Siegel (1956, p. 59) considered that the Kolmogorov- 
Smirnov test is a more powerful goodness-of-fit test than Chi-Square. 
The latter can not cope with small class values, so that class intervals 
have to be altered, resulting in loss of information. Small values, or 
even zero entries, do not affect the performance of the Kolmogorov-Smirnov 
test. Consequently, the latter technique has been used in this study to 


estimate the goodness-of-fit between real-world and theoretical distributions. 


The Kolmogorov-Smirnov test involves the specification of theor- 
etical and sample cumulative frequences. The degree of agreement between 
the expected and observed values is determined by the maximum deviation 


between the two, 
D = maximum |F CS) Ss (x) | (SL) 
fo) n 


where D is the maximum difference, BOX) the theoretical cumulative 
Gliisierewwaskeieia,, hare! S6%) the observed cumulative distribution. The null 
hypothesis, Ho! is that difference between the two distributions are 

not signifmeant but due merely to chance. The smaller.the value of D, 
the more likely that the null hypothesis can not be rejected. Since the 
sampling distribution of D under Hy is known (Siegel 1956, Table E, 


p. 251), comparison of the D value with critical values in the sampling 
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distribution permits acceptance or rejection of the null hypothesis. A 
full description of the Kolmogorov-Smirnov technique is provided by 

Siegel (1958, p. 47-52). The test should only be used when the variable 
to be examined has a continuous distribution. However, migration data are 
discrete. If the test is applied to discrete data, then the resultant 


error is on the "safe" side, since the test is a conservative one (Siegel 


1956, p. 59). 


In this study, the real-world observations are tested against 
theoretical and simulated distributions. Single simulation results are 
considered as samples drawn from averaged distributions to determine if 
one simulation run varies greatly from the averaged result. To facilitate 
testing of observed and expected distributions by the Kolmogorov-Smirnov 


technique, the data are presented in three ways: 


1. the absolute population values for each region are measured in 
nine size groups; 
2. percentage population values for each region are distributed 
among eight categories; 
3. the! nét population gain or loss of a region over a time period 
is expressed as a percentage of the population of the region at the beginning 


Gf a time; period. |The resultant values are arranged.in nine classes. 


In all three cases, the number of classes used for measurement 
are as numerous as possible, in order to take full advantage of the 


sensitivity of the Kolmogorov-Smirnov technique. (Tables 5.1, 5.2, Delis 


To illustrate the technique, Table 5.4 shows the expected 


(equilibrium) and observed (real world) cumulative frequency distributions 
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for regions falling within certain percentage size classes. The maximum 


Deevealucein nis example “is 95/26 (0.1923). To cause rejection of 


TABLE 5.4 - EXPECTED AND OBSERVED CUMULATIVE FREQUENCY DISTRIBUTIONS: 


EQUILIBRIUM AND REAL WORLD PERCENTAGE DISTRIBUTIONS, 1921-1931 


Class Equilibrium nee F(X) §._ (xX) b=|F (x)-s (x) | 

(Percentage) Solution yoeia fe) 26 fe) 26 
Ca) 8 6 8/26 6/26 2/26 

Of2 5a 5 a E50 1 2 OY26 8/26 17 26 

pe 5Or=e 00 4 @) 13/26 8/26 5726 

1.00 = 2.00 2 4 15726 12/26 3/26 

2.00) —_ A. 00 5 is 20/26 15/26 5/26 

A OUR ...00 8 8 2126 23/26 2/26 

B00 L6.00 eS 2 24/26 207 26 L726 

162.003= 2 i 26/26 26/26 0 


the null hypothesis—there is no significant difference between the 
equilibrium (expected) and real world (observed) distributions other 
than chance variations—the D value must exceed the critical probability 


value chosen. 


The maximum D value is less than the p value associated with 
phemmo. 207 tleve ae(lablews 25) 1. elnsother words,iithe probability of any 
difference between expected and observed distributions being due to 
chance factors alone is greater than 20 chances in 100. The null 
hypothesis can not be rejected. The observed distribution of regions 


can be considered as a sample of the expected equilibrium solution. 
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TABLE 5.5 - CRITICAL VALUES OF D IN THE KOLMOGOROV-SMIRNOV ONE- 


SAMPLE TEST 


Degrees of Level of Significance D = maximum [E,G@-s_ (x) | 
Freedom cageee ae Sey eT RM es a ema a cake atm ee 
. 20 5 LO ~05 SOU 


26 OFZLO2 0.2240 e239 7 OF26/2 Ore 202 


Source: = *Adapted from*Siegel (1956, Table EB, -pi 251). 


Equilibrium Solutions and Real World Observations: 


Comparison of two distributions is very difficult when there 
are aS many as twenty-six observations (Table 4.18). Only the more 
obvious deviations can be picked out by a visual examination. For example, 
the Markov chain model predicts that region 22, not region 13, will 
contain the largest percentage of the total population of the study area 
an time period one (1921-1931). After the first time period, region 13 
is clearly the dominant region in the equilibrium soluitons, as it is in 
the real world observations. The Kolmogorov-Smirnov test compares the 
two cumulative frequency distributions on the basis of the maximum 

observed difference. While some information is lost, because of the 
necessity to group the regions into size classes, the test takes into 


account overall variations in the system of regions. 


Results of the tests (Table 5.6) indicate that there is a 
promising relationship between the theoretical equilibrium distributions 
and the real world observations. The null hypothesis states that there 


is no significant difference between the expected and observed 
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TABLE 5.6 —- MAXIMUM DIFFERENCE VALUES BETWEEN EQUILIBRIUM AND REAL WORLD 


CUMULATIVE FREQUENCY DISTRIBUTIONS 


Frequency Distribution 


Time No MGs ake Giions No. of Regions Nos or Begions 
‘ : by Percentage by Percentage 
Period by Population : 5 
Oprtotal Repulation Gain or Loss 
Lo2le1931. OF Le2 One 2 Oe23. 
1931-1941 ORES Ons On23. 
1941-1951 O.077 OO 7a7 O23. 
1951-1961 OF2Z31 ORL 2 Os 231 
distributions, and that any differences are due to chance factors. [In 


no instance can the null hypothesis be rejected at the conventional 
pepercalstiamics oF 90,05) ands O.0Ls Fave cut of the twelve D values 
indicate that the null hypothesis may be rejected at the comparatively 
weak confidence level of 0.15. A value equivalent to or less than 
themep value ore O50 suggests that any differences, occurring purely 


by chanee and not Signi ficantumay yariser LOSytimes an 100 cases. 


TABLE 5.7 - MAXIMUM DIFFERENCE VALUES BETWEEN SIMULATED AND REAL WORLD 


CUMULATIVE FREQUENCY DISTRIBUTIONS 


Frequency Distribution 


; N fa pyet No. of Regions No. of Regions 
are Rtas vr by Percentage by Percentage 
caaae ee aa of Total Population Gain or Loss 
Averaged Single Averaged Single” Averaged Single 
1921-1931 0.038 G2038 O.077 O07 7, 0.038 0.038 
1931-1941 0.038 0.038 @) ) One en liey 
toad i= Loo) Q5039 0.038 0.038 0.038 OeitS Oakis 


LoS l= LOG. 0.038 0.038 0.0338 0.038 0.154 Osis 
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Simulated Results and Real World Observations 


Pitts (1963) suggested that the results of a: series of simu- 
lations can be averaged to produce a combined or "omega" distribution. 
Since individual simulation runs vary widely, the averaged solution is 
assumed to have more theoretical significance than one single simulation. 
In this study, observations are tested against a randomly selected 
simulation in each time period, and against the average of fifty 
simulations. Kolmogorov-Smirnov tests indicate a high degree of 
association between real-world observations and both averaged and single 
simulation runs, (Table 5.7)2) in no instance en null hypothesis 
be rejected with any significant degree of confidence. All maximum 


Githitewencemyalhue sare less thon tie critical 2.0.20 confidence level. 


TABLE 5.8 - MAXIMUM DIFFERENCE VALUES BETWEEN 


AVERAGED AND SINGLE SIMULATIONS 


Frequency Distribution 


T4 N Capea No. of Regions No. of Regions 
eh ee aie, by Percentage by Percentage 
es eg Pao st of Total Population Gain or Loss 
Po boo 1 O O 0) 
193i-194L 0) 0) 0.038 
LIA ToS) 0.038 O 0.038 
O 0.038 


ie joy Fone ie Al O03 


There is very little difference between the results of the 
single simulations and the average of fifty runs (Tables 4.10 and 4.11). 


When both distributions were grouped into classes, the frequencies were 
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Vietyeclosce, sand (identical in «some instances .(Tables 5.1, 5.2, and 5,3),. 
Kolmogorov~Smirnov tests were applied to these frequencies to see if one 
single simulation varied significantly from the combination of runs. 
The single simulations (observations) were regarded as samples coming 
from the averaged (theoretical) solution. The resultant maximum 
difference values are extremely low (Table 5.8). Had different single 
Simulations been chosen, the results might have been quite different, 
as evidenced by the standard deviations of the combined simulation runs 
(Table 4.9). However, the single simulations were chosen completely 


at random, and must be considered typical. 


TABLE 5.9 —- MAXIMUM DIFFERENCE VALUES BETWEEN EQUILIBRIUM 


AND SIMULATED DISTRIBUTIONS 


Frequency Distribution 


. 5 No. of Regions No. of Regions 
Time No. of Regions 
Pod ie Sees pes by Percentage by Percentage 
- ee Of lOtal Population Gain or Loss 
Averaged Single Averaged Single Averaged Single 
Ned not UVC On g2 Ons Oame 2 Orlow OrZ34 Ors 1 
LO ZI=19AL O41 15 OnLES Opals O15 0.231 0.269 
1941-1951 OSELS OFLL5 Ores Oa Owe 0.154 
L751 — 196) On23t Onto 2 O2192 On 2 O22 33) Ono 2 


Equilibrium Solutions and Simulated Results 


The experimental sampling model (Monte Carlo) and the analytic 
Markov chain model were tested against each other to see if further 


differences could be revealed. In general, results suggest little 
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difference between the equilibrium (expected) and simulated (observed) 
distributions (Table 5.9). However, when the equilibrium solution is 
compared with the results of a single simulation in time period two 
(1931-1941), in one instance, the null hypothesis of only chance 
differences between expected and observed distributions can be rejected 
at the 0.05 confidence level. The corresponding equilibrium and 
averaged solutions have a much smaller maximum difference value, so 
that the hypothesis can only be rejected at the 0.15 confidence level. 
If calm.  OAr@s) confidence level is regarded as critical, then combined 
and single simulation results perform quite differently. This occurred 
only once in twelve possibilities at the 0.05 confidence level, but 
suggests that greater reliability can be given to combined rather than 


Single simulations. 


Summary 


Regional population figures have been organized into frequency 
distributions. Similar frequencies have been derived for the regional 
scores predicted by the combined and single simulations, and the Markovian 
equilibrium solutions. The expected (theoretical) and observed (real- 
world) frequencies are tested by a one-sample Kolmogorov-Smirnov test. 

The one-sample test permits the acceptance or rejection of the hypothesis, 
that there is no significant difference between observed and expected 
frequencies, other than chance variations. The real-world observations 
can never be rejected, at the 0.05 confidence level as coming, from the 
predicted, theoretical distributions. The Kolmogorov-Smirnov technique 
also indicates little difference between the results of single and 


combined simulations. 
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Analysis and evaluation of the Monte Carlo and Markovian models, 
based on the results of the Kolmogoroy-Smirnov test, follow in Chapter 


Sse. 


han acl 


CHAPTER SIX 


CONCLUSIONS 


The primary aim of the thesis is to investigate the relative 
efficiency of the Monte Carlo simulations and Markovian equilibrium solu- 
tions. Out of this has emerged the secondary task of evaluating the 
significance of an averaged simulation solution and a single simulation 
run. On the basis of the Kolmogorov-Smirnov tests presented in Chapter 
.-Five, the Monte Carlo and Markovian models successfully reproduce the 
features of the real world within reasonable bounds of confidence. 
Qualifications must be added, since the one-sample Kolmogorov-Smirnov 
test is less powerful than its parametric counterparts. Also, as 
regions' scores were grouped into classes, a certain amount of information 
was lost. Despite these limitations, the results clearly indicate that 
both the Monte Carlo and the Markovian techniques function reasonably 


well. 


The maximum difference values between Simulated (expected) and 
real world (observed) cumulative frequencies (Table 5.7) are generally 
less than the difference values obtained by comparison of the equilibrium 
solutions with the real world (Table 5.6). However, in the latter case, 
the null hypothesis—differences between observed and expected distributions 
abewnot significant, but due to chance—can never be rejected at the 


critical 0.05 and 0.01 contidence levels. 


The Kolmogorov-Smirnov test examines the total variation of the 
a7 stem (On ireglons.emci ice Lhe Lest)requimes the Specitication of 


cumulative frequencies, positive and negative scores of regions can not be 
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evaluated by this method. “The: tabulated data (Tables 4.10, 4.11, 4.18) 
and choropleth maps (Figures 14-21) indicate some of the more significant 
deviations of observed values from expected figures. Table 4.18 

presents the variation between equilibrium (expected) and real world 
(observed) values for regions, in terms of the percentage of the total 
population of the area in each region. In general, there is a reasonable 
fit, though the Markovian model in time period one (1921-1931) predicts 
Pveatereqion 22) will be dominant, not reqion 13, which contains Crande 
Prairie, the capital seat of the Peace District. The Markovian model 
continually under-predicts the percentage value of region 5. Deviations 
between the predicted simulated values and the real world observations 
are also apparent (Figures 14-21). For example, both the single run and 
combined simulations exaggerate the size of region 13 in time period two, 
1931-194) (Tables 4:10, 4.11). Such fluctuations are generally Jess than 


those observed between equilibrium and real world values. 


A cursory interpretation of the results would conclude that 
the simulation model performs slightly better than the Markovian model, 
since the overall variation between observed and expected values is less, 
ends tie, prediction of Lie scores of individual regions’ 15 closer to the 


observed figures. However, certain qualifications must be made. 


One reason for smaller deviations of simulated values from 
observations may lie in the method by which the constants of natural 
increase and external migration are introduced into the Monte Carlo model. 
Such constants impose rigid controls upon the results of the simulations. 
In the Markovian model, following derivation of xX as D vectors, 


which take natural increase and gain or loss due to external migration into 
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de has the whole population was subjected to the process of population 
redistribution. Calculation of the expected number of potential migrants 
was not necessary for the Markovian model, but was required by the Monte 
Carlo method. This is a powerful factor in favour of. the Markov chain 
model of migration, since the elements of population change—natural 
increase, and external and internal migration—occur together continuously 


in the model. 


Hence, the Markovian model is more realistic than the Monte 
Carlo method. Moreover, the operational procedure of the former is 
simpler. The number of calculations required by the Monte Carlo model 
is much greater than the Markovian techniques. The running and averaging 
in each time period of fifty simulations required many hours of computing 
time. In contrast, once the transition matrices and the X vectors were 
constructed, the Markovian equilibrium solutions were achieved by a simple 


procedure of matrix and vector multiplication. 


The author contends that the operation of the Markovian model 
of migration is more satisfying than the Monte Carlo technique. Admittedly, 
the Kolmogorov-Smirnov tests indicate a slightly Breet fit between 
observed and expected values when the Monte Carlo model is used. Yet, 
the results achieved by the Markovian model are still excellent, and in 
no instance can the null hypothesis, that both observed and expected 
distributions are similiar apart from random deviations, be rejected 
at the 0.05 confidence level. The implementation of the Markov chain 
model is both more realistic, since the elements of population change 


occur together continuously, and more flexible, as the computations 
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required are much less than those demanded by the Monte Carlo model. 


The excellent performance of the Markov chain migration model 
should not be allowed to obscure several drawbacks, especially if the 
technique is tested with more detailed information. Both the Markovian 
and Monte Carlo models in this study are general in nature. In both 
models a simple inverse relationship between migration intensity and 
distance is the basis on which a theoretical population distribution 
is generated. The under-prediction by the Markovian model of the size 
of region 13 in time period one, and the continual under-estimation 
of the size of region 5, indicate that factors other than the simple 
inverse-distance relationship need to be considered. The low predicted 
value for region 13 in time period one would suggest that the marginal 
location of this region is enough to off-set the high population density 
(Figure 9). In contrast, region 2 received an exaggerated value, 
because of its central location, and its moderately high population 
density. Again, the constant under-prediction of region 5 by the 
Markovian model seems to be a reflection of that region's location in 
the south-east extremity of the study area (Figure 9). Since the 
transition matrices, derived by means of a gravity model, incorporate 
both distances between, and size of, regions relative location becomes 
extremely important. Some additional functional weighting is desired in 
theecasenot, regionlelimibéecause Crandé*Praizie tis "a régional centre 
for the Peace District. However, the equilibrium solution only under- 
estimacedisinems2e idfUregionNels “inetime period*one "(1921-1931)" *In 
the other three decades, the size in population of region 13 was 


sufficient to overcome the disadvantages of the region's location. 


" 
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If independent variables other than distance were considered 
necessary in a model of migration, ‘then how are they to be incorporated? 
In Geography, analytic models have not been too successful because the 
derivation of joint probability functions for spatial variables poses many 
mathematical problems (King 1969, p. 230). The use of physical and 
functional barriers in simulation methods is a response to the absence of 
joint probability functions. Inthe Monte Carlo model barriers were used 
to modify the inverse-distance postulate. No similar procedure was 
possible in the Markov chain model. Additional factors could perhaps be 
incorporated in matrix operation as recommended by Rogers (1966b). He 
included age and sex structure in a model of migration. In theory, two- 
dimensional matrices could be expanded to three or even q-dimensions to 
accommodate many more than two variables (Olsson and Gale 1968). lowever, 
whether complex multi-dimensional Markovian models can be made operational 


remains to be seen. 


The Monte Carlo method has the powerful advantage of being able 
to incorporate chance deviations from the expected probabilities of 
migration. This is extremely important, owing to the disadvantages imposed 
by the probabilities. The gravity model used to estimate the Markovian 
transition matrices, and the Pareto functions employed by the Monte 
Carlo method, have similar disadvantages. Both incorporate only one 
independent variable (distance) and both require estimation of parameters 
from the observations to be predicted. Such parameters have Male eee 
theoretical significance. These drawbacks can be overcome somewhat by the 
Monte Carlo method since the process of allocating random numbers to 


migrations permits chance factors to modify the expected results. The 
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incorporation of the chance element overcomes, to a certain degree, the 
ignorance of the researcher because it reflects unforeseen factors. In 
the Markovian tha the probabilities of migration were basically similar 
to those utilized in the Monte Carlo model, but were not modified to 
include chance variations. Consequently, several significant deviations 
between predicted and observed values arose, such as the under- 


SSEtmacion Of the size Of region 5. 


As a general model of migration, the Markovian Pech is 
successful, but if the method were applied to situations where more 
detailed data are available, and if a greater level of precision were 
required, then the transition probabilities would need to include the 


effect of other significant physical, social, and economic variables. 


Markovian models are proposed as an analytic alternative to 
simulation methods, but Monte Carlo models can also be improved. The 
Monte Carlo model produced similar results for both single and combined 
Simulations. As a further check, Simulation distributions were compared 
tothe equilibrium solutions to reveal any unsuspected difterences 
(Table 5.9). In one case, the results of a single simulation were 
rejected at the 0.05 confidence lével as coming from the theoretical 
population. The averaged solution was not rejected. The author believes 
that an averaged result is more accurate than a single simulation in the 
case where discrete moving units are considered. High speed computers 
make the averaging of many simulation runs simple, although the task is 


still time consuming. 


Since the Markov chain model of migration used in this study 


Nob 


reasonably duplicates the performance of the Monte Carlo model, and 
conceptually is much neater and not so laborious to operate, Markovian 
models could be given wider application. In Geography, Markovian models 
have chiefly treated population movements (Rogers 1966a, Compton 1969) and 
diffusion (Brown 1963). Economists have used the technique to predict 

the interregional movement of industry (Hampton 1968) and tee (Smith 
1961). Conceivably, Markovian models could investigate a wide variety of 
forms of spatial interaction, including migration, diffusion, eee of 


settlement, consumer behaviour, and changes in urban morphology. 


The technique need not be applied solely to the prediction of 


future spatial distributions. The models in this study were applied to 


a wide historical time span. The continuity of results through time 
suggests that the Markovian model could be used as an historical - 
predictive device in a detailed study. Like the Monte Carlo method, the 


model could test hypotheses about spatial behaviour through time. 


If Markovian models are to be given wider application, then 
the transition matrices must be meaningful, and capable of incorporating 
the influence of several significant variables. Unless these conditions 
can be fulfilled, the conceptual elegance of Markovian models will be 
void and Monte Carlo techniques will continue to function as a reasonable 


alternative to oversimplfied analytic models. 


a2 
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APPENDIX A 


CALCULATION OF PARETO FUNCTIONS 


A number of studies has » shown the existence of a significant 
inverse relationship between migration intensity and distance 
(Hagerstrand 1957, Morrill 1963a, and Olsson 1965a). This relationship 
Was identified for migration in the Peace District 1921 to 1961, and 


expressed in mathematical form in the Monte Carlo model. 


The first step was the definition of the two variables, 
migration intensity and distance. Imagine a chart with a Dene oe Vere Kenmeln 
around which a series of rings are drawn at ten kilometres apart. The 
distance variables were taken as the distance (kilometres) between the 
point of origin and the mid-point of each ring. The number of migrations 
ending in each ring was obtained to provide the migration intensity 
variables, expressed as the number of migrations per square kilometre to 


each ring. 


The variables were tabulated for each time period (Tables A-1 
to A-4) and a general decrease of migration with distance was observed. 
Following identification of the inverse-distance relationship, the 
association of the two variables was expressed mathematically by fitting 


Pareto functions to the data. 
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* 
TABLE A.1 - OBSERVED LOCAL MIGRATION IN THE PEACE DISTRICT, 1921-1931 


Ring Area 
g Number of 


Distance from Origin 


Kilometres) (Kilometres) 

314 16 0.05095 5 

942 4 0.00425 15 
1884 6 0.00318 25, 
3140 Ml 0.00032 35 
4710 4 0.00085 45 
6594 2 0.00030 5 
Sune ¢) 0.00000 65 
11304 iL 0.00009 75 


TABLE A.2 - OBSERVED LOCAL MIGRATION IN THE PEACE DISTRICT, 1931-1941" 


Ring Area 


Distance from Origin 


£ Mi t “ 
‘a ee (e@) Rubee; (Cee 


Kilometre (Kilometres) 
10828 3 
01168 a 
00371 oe 
00064 33 
00127 ca 
00030 2 
00011 ae 
00044 eh 
00007 fe 
00006 2° 


eae eo eee 
Kilometres) i 
314 34 6) 
942 1l Ow 
1884 if, Or 
3140 2 O. 
4710 6 m0 
6594 2 OF 
S792 dA 0% 
11304 5 ie 
14129 1 Or 
L270 i QO. 
eres: Sample survey by 


the author 1969. 
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TABLE A.3 - OBSERVED LOCAL MIGRATION IN THE PEACE DISTRICT, 1941-1951" 


Ring Area 


Distance from Origin 


(Square ene at Vee oe oe to Ring Centre 
iemetees) grants per Sguare Kilometre (ei lometces) 
314 24 0.07643 5 
942 19 0702017 15 
1884 7 0.01423 ERS) 
3140 6 O200121 35 
4710 0) 0.00000 45 
6594 6 0.00091 55 
oT 2 4 0.00045 65 
11304 4 0.00035 ie 
14130 2, 0.00014 85 
17270 6) 0.00000 95 
20724 2 0.00009 105 


TABLE A.4 — OBSERVED LOCAL MIGRATION IN THE PEACE DISTRICT, 1951-1961* 


Ring Area 


Savere ames 
Kilometres) 
314 25 
942 13 
1884 12 
3140 5 
4710 6 
6594 4 
S192 3 
11304 9 
14130 5 
* 
DOUMmGes 


Number of Migrants 
per Square Kilometre 


eS SS ©& oS oS f€& O& & © 


Sample survey by 


507962 
-01380 
-00637 
200159 
aOOL Zi 
-00061 
-00034 
- 00080 
-00035 


the author, 1969. 


Distance from Origin 
to Ring Centre 
(Kilometres) 


5 
NES) 
25 
35 
45 
she) 
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where M is the number of migrants per square kilometre, D is the 
distance variable, and c and b are constants to be estimated. The 
function should be fitted to data which exhibit a linear relationship. 
To make this assumption more reasonable, the variables were transformed 
to Vlog M “and (log D ~ The function was, fitted by the method of least 


squares. The least squares line has the form, 


where Y is the dependent variable, X is the independent variable, 


a is the slope, and ay is the intercept. Knowing, 
fo) 


and 


Log i= leg ic = blog D 


assume that Y= log M and X log D , then 


LGi7 


where ree logec. and ea -~b . These constants can be estimated by 


the procedure suggested by Spiegel (1961, p. 220), 


(Jy) (ix?) - (Gy) Oxy) 


(e) Nyx" -~ (Sx) 7 


and 


Nyxy = ()x) (Jy) 
eg denies aso i 


iL Nyx? Se (Sx)? 


where N is the number of observations. By this method, the following 


equations were determined, 


19210] 1931 mee. 
1931 - 1941 M = 1,047 7°78 
ev, Se ER Meee. e7 ou Dekon 
1951 - 1961 M = 0.69180 °"7° 


The values of c and b were fed directly into the Monte 
Carlo programme to determine the raw probability of interaction between 


req Cn meranderegion aj. this is > giveniby 


where c and b are the derived constants, A is the area of the 
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destination region, and D is the distance between the centres of the 
two regions. For example, the raw probability of contact in time period 


one (1921-1931) between region one and region two (see Table 4.6) is 


5.370 (3339) 


elesie oO 


= 0.1800 


where 5.370 and 2.80 are the c and b values respectively of the 
Pareto expression, 3339 is the area (sq. kilometres) of region two, 
and 61.474 is the distance (kilometres) between the centres of regions 


one and two. 


Raw probabilities were then smoothed and adjusted by population 
to derive the final probability of migration between region i and 


Begion™® “j=: 


The Pareto function extends from zero to infinity. Consequently, 
in this thesis, the distances put into the function are greater than 


Zero, sana eSsetianvincinicy i MorriLLeloGsa epee ol). hus pit 
M = p(d) 
then 


=e) 
p(d) = cD , Om Di er 


APPENDIX B 


MARKOV CHAINS 


A simple numerical example of a regular Markov chain is 


provided to illustrate the comments made in Chapter Three. Assume an 


initial probability vector 69) = CS, =) , where the probability of 


phenomena being distributed in state one is 3/4 , and in state two 


is 1/4 . The transition probabilities are given by 
ne 
Dis 2 2 
og 
4 4 


How will the phenomena be distributed after one step, knowing the 
initial distribution and the transition probabilities? The probability 
of phenomena being in state one after one step is the sum of the 
probabilities of being in each of the possible states at the previous 


step, and then moving to state one on this step. Thus, 


= te 5 
ai 1 ae 2e 
where a is the probability of phenomena being distributed in state 
(0) (0) wre 
one aiter one Step, Py and Py are the probabilities of phenomena 


being initially distributed in state one and state two, respectively, 


| 


Pil Sethe transition PLoOpabDliicy OL moving Lrom State one Co state one, 


and Poy the probability of moving from state two to state one. Equally, 
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uf : nat : : 
where Ps is the probability of phenomena being in state two after 
one step, and Pio and Po, are the transition probabilities of moving 
from state one to state two, and from state two to state two, respectively. 


In this example, 


Q).3,1,1.3 
“a Ded sA 
ad 
@le 
and 
2) a Ue 
AL ee ea 
ze alle 
16 
In practice, it can be shown that 
(n) (n-1) 
ie de a 
where p”) represents the probable distribution of phenomena after 
fee steps, pate! the probable distribution after n- 1 steps, and 


P is the matrix of transition probabilities. In this example, 


hgh 


a Fi 
(1) ep val 2 2 
pt ge E ih 
ee 
2, See 
(76 ie? 


where the probability of phenomena being in state one after one time 


MerlOis1 suo, lOenend 3nictacestworrs) 7/16... fqually, 


y\?) ane) Pp 


=< p e 
1 ok 
Poet aM 2 2 
16 16 Seer 
4 4 

64 64 


where the probability of being in states one and two after two steps is 
39764 “and 25/04 , respectively. sin all cases, >) iS a probability 


vector which sums to unity. Also, if 


then 


where 


PS We! oe le Spe Ve inhale bo 


factors) 


Thus, 


where 


po = pas P 
In the above example, 

2 ee 
(2) aes al 8 8 
ea 4) Slo) 
UG 16 

Ge Bs 

64 64 


This is the same answer derived by expression 


cy pe 


Matrix multiplication enables successive state of the system 


to be computed rapidly. 


APPENDIX C 


TABLES OF POPULATION STATISTICS BY REGIONS, 1921 to 1961 


TABLE C.1 - REGIONAL POPULATION FIGURES, 1921-1931 


Expected Total Expected External 


Natural 


en Population Births Deaths meas MigrationGain - Migration 
1921 R92 lel O3 ee 21) 931. 1921-1931 or Loss u Gain or Loss 
I) ALLS) Shab MAAS) ShAk 
1 0 0 0 oo 24 24 
2 148 48 13 35 292 373 
3 2 0 0 0 23° 24 
4 104 34 g 2S 285 PASE, 
5 IL 3AZ 433 124 SOE) Eo IL ALEKS) 
6 Udi 25 i 18 408 522 
7 0 0 0 0 24 | 24 
8 Ye 23 6 ale 52 66 
€ TLS6 Sy) 108 27 i 1882 2097 
10 0) 6) 0 O iL@ 10 
ind ash 82 DS ie! 796 1018 
2 283 404 A bes ts 289 IL Bil a eae 
TS 4030 iheyal BUS 943 2160 2003 
14 852 219 Ue 200 974 1246 
ds) PTE 416 BIS 297) 2009 1686 
16 354 eG 33 83 948 970 
Ei 430 140 40 100 180 192 
18 1260 413 118 BAS JS) 858 920 
a2) 290 o5 27 68 1330 Logs 
20 0) 0) 6) O 32 348) 
Eile 37 438 US) Sy @) 477 542 
Ze 1456 477 136 341 Tad 266 
23 46 is) 4 dull 1468 LO, 
24 Bos ley, BS 84 abate?) 954 
20 6) O 0 @) 24 24 
26 6) 6) O 6) Su: OS 


Source: Census of Canada and Vital Statistics reports of Alberta 


“calculated from data derived by a sample. 
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TABLE C.2 - REGIONAL POPULATION FIGURES, 1931-1941 


ose ExpectedTotal Expected External 


foctba Population Births Deaths rises Migration Gain Migration 
ie 31-1941 1931-4 jo31-1041 OF Loss Gain or Hees 
1931-1941 7931-1941 
uf 24 il O a 3 3 
2 A475 172 Zu. 145 226 6) 
eS 25 2 1 iL 67 81 
4 414 169 28 141 HGS O 
5 2823 707 180 S20 =192 =222 
6 503 101 23 78 20 9 
7 24 4 > ~] 91 iQ Mt 
8 141 26 10 16 47 O 
o 3509 19092 245 847 PAS coils b 
10 10 6) 6) 0 Lo ale 
13h 1106 298 66 232 =39 -108 
42 1341 424 135 239 =lid, exes) 
13 11335 1674 450 1224 =L089 -424 
14 2026 410 120 Zo) -204 —4197, 
LS 5 1065 295 iO -602 oo fake! 
16 1385 320 70 25] 157, 459 
7 710 252 42 210 130 164 
18 2413 856 206 650 —413 -494 
19 1688 289 67 222 136 146 
20 32 0) 0) 6) 6 s) 
yah 24127 506 165 341 0) -149 
22 2524 674 184 490 = 6) -266 
23 L520 528 90 438 106 187 
24 1562 366 68 298 =a -47 
25 24 5 ik 4 52 64 
26 £51 29 3 22 184 225 


Source: Census of Canada and Vital Statistics Reports of Alberta 


*calculated from data derived by a sample. 
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TABLE C.3 - REGIONAL POPULATION FIGURES, 1941-1951 


——_____..____— = = ee 


Expected Total Expected External 


oie Population Births Deaths —oiibieste Migration Gain Migration 
1941 I94}-1951 1941-1951 1o41-p51 °F Loss x Gain or Loess 
194151 1941-1951 
A 28 8 3 5 5) =) 
2 846 oe7 69 268 103 0 
3 93 14 4 10 -48 le 
4 730 259 50 189 =68 =103 
5 3 159) 1040 250 120 ite 24 
6 610 215 50 165 AQ 198 
i) 114 22 10 1g -A3 =67 
8 204 58 14 44 -46 72 
9 3874 1356 S25 1033 47 63 
10 0 0 0 0) 10 13 
shah 1249 399 98 301 a2 99 
12 L453 52a 141 386 356 0) 
73 7268 2468 bo? 1876 SEES mle a 
14 212 605 144 461 SES) -1465 
iS 3745 1218 316 902 aS, -682 
16 VIS) 5o2 £28 424 ~640 =a62 
iby 1050 B61 Ue) 282 OE 1024 
18 2650 942 226 716 ARN) =359 
19 2046 582 35 447 oy 3 -494 
20 38 7 3 4 au 85 
au 2378 769 198 SWS =306 —239 
ae. 2758 879 229 650 -448 -444 
Be 2069 632 134 498 =O, S202 
24 1844 568 136 432 -418 0 
25 80 18 4 14 3 4 
26 357 101 24 77 ats ic 


 —— 


Source: Census of Canada and Vital Statistics Reports of Alberta 


* 
Calculated from data derived by a sample. 


- 
ef 2 rT Ad oe \nnigie 


rv Lato t cet 


ae 
2 


Lbel 


a 


a 
To ngs 
a 


{ 
SS 


_ 


e 


. 


we 


= 


a 
Ls b x! 
. a3 4 


~ 


7 9 7 
' © & 


c 


176 


TABLE C.4 - REGIONAL POPULATION FIGURES, 1951-1961 


Neal Expected Total Expected External 


ashen Population Births Deaths pees an Migration Gain | Migration 
ie oh NEM EUS eH aH eIoae 195-1961 or Loss . Gain or noes 
1951-1961 dbs Voy RAE) oy 
il 28 Ike Ds 10 90 76 
2 Lau? 560 dhag@) 450 855 600 
3 55 25 4 21 == 15 aot Be} 
4 Sod Bol qi 314 ehh 0 
Ss 4061 1870 367 1505 eye) colbeyeyt E 
6 815 375 3 302 vale /2) eic5 
wv 90 4l 8 he) 4 4 
8 202 93 18 15) 108 90 
2 4954 2281 448 1833 =i 2 a WTR 
10 10 4 0 4 rae =L3 
ak 1662 USS 150 615 neh) Sey 
2 2195 1011 198 813 698 587 
3 8229 3790 744 3046 2274 19 i6 
14 1614 743 146 If 5620 =A16)/46} 
BES) 3634 1673 328 1345 SaoG ATS 
16 Leon he! 726 142 584 =f EK) =560 
Thy 1609 741 145 596 -439 =365 
18 32595 1489 292 JE5/ -845 -444 
NES} PAL PAB) 976 IRS) 785 S22 -1648 
20 39 A5 8 SV ey iusy b 
21 2593 1194 234 960 Syl -284 
22 2960 1363 267 1096 -633 eA ey 
23 2300 1059 208 851 Sethe) =1307, 
24 1858 855 168 687 SES) Sak 
25 97 44 8 36 -49 -42 
26 415 ol ey) 154 167 _ 0 


Source: Census of Canada and Vital Statistics Reports of Alberta 


* 
Calculated from data derived by a sample. 
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APPENDIX D 


THE SAMPLE QUESTIONNAIRE 


A 98% response to a questionnaire on population migration was 
achieved. Interviews were carried out over the telephone. The high 
degree of success by the telephone method negated the necessity of 
sending out further questionnaires to be completed by mail. The author 
believes that a questionnaire administered by telephone is extremely 
efficient if the circumstances are correct. Firstly, all respondents 
were sent a covering letter ne a list of the questions to be asked. 

The questions could be asked in a simple direct fashion, and topics of 
income and age were omitted as they were not considered necessary in 
this study. The people of this largely rural based area are generally 
well aware of its recent history, and many had been among the original 
settlers. Most were more than eager to co-operate and speak of their 
parent's or their own role in settling this region. In fact, the 
greatest difficulty encountered was the necessity to break off telephone 
calls, which were becoming too lengthy. That this technique would work 
in urban areas is doubtful, since urban residents are usually more aware 
of the canvassing telephone calls made by salesmen. If the topic had 
been less interesting to the people of the Peace District, and had the 
questions been more personal, then the response might not have been so 
high. The success of the method suggests that geographers might make 


more use of the telephone to carry out short, non-controversial interviews. 
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Department of Geography The University of Alberta 
Edmonton, Canada 


August, 1969 


Dear Sir or Madam: 


My name is Ian Lindsay. I am writing a Master's Thesis at the 
University of Alberta in the Geography Department, and am studying the 
movement of settlers in the Peace River District. 


I have selected your name at random and would like to ask you 
several questions about the places where you and your parents have lived. 


Unfortunately, I will not be able to meet everyone in person, 
so I will phone you in the next few weeks to obtain your answers to the 
enclosed questions. I hope this will not inconvenience you. 


This telephone conversation will take only a few minutes and will 
deal only with where you and your family has lived. All information 
provided will be completely confidential. 


Without your assistance, I can not carry out my study of the 
history of settlement in the Peace River District. I would be grateful 
for any additional comments you may wish to make. 


Sincerely, 


Ian Lindsay 

Graduate Teaching Assistant 
Department of Geography 
University of Alberta 
Edmonton, Alberta 

Phone: 432-4158 
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PEACE RIVER SETTLEMENT QUESTIONNAIRE NO. 1 


Number: Respondent: 


These questions largely refer to places and dates. Could you 


please be as precise as possible, especially when referring to the 
Peace River District. In the latter case, give township, range, and 
quarter location for rural addresses. 


Section 1s) To be £filledsin by head of household 


B. 


Where and when was your father born? 
Place: 


Date: 


Where did your father live before his marriage? 
Give dates for each address. 


When were your parents married?. 


Date: 


Where did your father live after his marriage? Include his present 
address if still alive, and his addresses even if his spouse has died. 
Give dates for each address. 


If your father is dead when did he die? 


Date: 
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Where and when was your mother born? 
Place: 


Date: 


Where did she live before she was married? Give dates for each 
address. 


Where did she live after her marriage? Include her present address 
if still alive, and her addresses even if her spouse has died. Give 
dates for each address. 


If your mother is dead, when did she die? 


Date: 


Where and when were you born? 
Place: 


Date: 


Where did you live before your marriage? If not married, where have 
you lived? Give dates for each address. 


When were you married (if applicable)? 
Date: 


Where have you lived since your marriage (if applicable)? Give 
Gates for each address. 
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14. I£ your spouse is dead when did he or she die? 


Date: 


Section II: To be filled in by the spouse (if any) of the head of the 


household 
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